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CN 2H-Indol-2-one, 3-[ (3, 5-dimethyl-lH-pyrrol-2-yl) methylene] -1, 3-dihydro-5- 
methyl-, (3Z)- (9CI) (CA INDEX NAME) 



Double bond geometry as shown. 
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AB The present invention relates to the 3-dimensional structures of a 
protein 

tyrosine kinase optionally complexed with one or more compds. Thus, a 
310-amino acid fragment fibroblast growth factor receptor 1 (residues 
456-765, FGFR1) was recombinant ly prepd. contg. the amino acid 
substitutions Cys488 . fwdarw. Ala, Cys584.fwdarw.Ser, and 
Leu 4 57 . f wdarw . Val , 

and an addnl. 5 residues (Ser-Ala-Ala-Gly-Thr) at the N-terminus. -X-ray 
crystallog. yielded the at. structural coordinates of cryst. FGFR1 and 

complexes with adenylyl diphosphonate, 3- [ (3- (2-carboxyethyl) -4- 
methylpyrrol-5-yl)methylene]-2-indolinone, or 3- [4- (4-f ormylpiperazine-1- 
yl)benzylidenyl]-2-indolinone. Two forms of cryst. FGFR1 were obtained- 
one form (designated C2-A form) with unit cell dimensions of a - 208 3 *b 
- 57.2, c = 65.5.ANG. and .beta. = 107. 2. degree. , and another C2-B form 
with dimensions a = 211.6, b - 51.3, c = 66.1.ANG. and .beta. - 
107. 7. degree. . The overall structure of FGFR1 is bi-lobate. The 
N-terminal lobe of FGFR1 spans amino acid residues 4 56-567 and comprises 

a 

curled .beta. -sheet of five antiparallel strands and one . alpha . -helix 
The C-terminal lobe spans amino acid residues 568-765 and comprises two 
.beta. -strands and seven .alpha. -helixes. The at. coordinates that 
define 

the structures of the protein tyrosine kinase and any of the compds. 

bound 

to it are pertinent to methods for detg. the 3-dimensional structures of 
protein tyrosine kinases with unknown structure and to methods that 
identify modulators of protein tyrosine kinase functions. 
IT 186611-14-3D, complex with fibroblast growth factor receptor 1 
RL: BUU (Biological use, unclassified); PRP (Properties); BIOL 
(Biological 

study); USES (Uses) 

(crystal structures of a protein tyrosine kinase) 
186611-14-3 CAPLUS 

lH-Pyrrole-3-propanoic acid, 5- [ (1, 2-dihydro-2-oxo-3H-indol-3- 
ylidene ) methyl ] -4-met hvl- (9CI) (CA INDEX NAME) 
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(57) Abstract 

The present invention relates to the three- 
dimensional structures of a protein tyrosine kinase 
optionally completed with one or more compounds. 
The atomic coordinates that define the structures of 
the protein tyrosine kinase and any of the compounds 
bound to it are pertinent to methods for determining 
the three-dimensional structures of protein tyrosine 
kinases with unknown structure and to methods 
that identify modulators of protein tyrosine kinase 
functions. 
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DESCRIPTION 

CRYSTAL STRUCTURES OF A PROTEIN TYROSINE KINASE 

5 RELATED APPLICATIONS 

This application is related to U.S. Application 
Serial No. 08/701,191, by Mohammadi, et al., entitled 
"Crystals of the Tyrosine Kinase Domain of Non- Insulin 
Receptor Tyrosine Kinases," filed August 21, 1996 (Lyon 

10 Sc Lyon Docket No. 227/088) and U.S. Application Serial 
No. .60/034,168, by McMahon, et al . , entitled "Crystal 
Structures of a Protein Tyrosine Kinase Complexed with 
Compounds of the Oxindolinone/Thiolindolinone Family," 
filed December 19, 1996 (Lyon & Lyon Docket No. 

15 221/282) , which are hereby incorporated herein by 
reference in their entirety including any drawings, 
tables, and figures. 

INTRODUCTION 

20 The present invention relates to the three 

dimensional structures of protein kinases. 

BACKGROUND OF THE INVENTION 
The following description of the background of the 
25 invention is provided simply as an aid in understanding 
the invention and is not admitted to describe or 
constitute prior art to the invention. 

Protein tyrosine kinases (PTKs) comprise a large 
and diverse class of enzymes (for a review, a e 
30 Schlessinger and Ullrich, 1992, Neuron 9: 383-391). The 
PTK family contains multiple subfamilies, one of which 
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13 the fibroblast: growth factor receptor (FGF-R) 
subfamily (for a review, see Give! and Yayon, 1992, 
FASE3 J. 6 (15): 3362-3369). 

All PTKs enzymat ically transfer a high energy 

5 phosphate from adenosine triphosphate to a tyrosine 
residue in a target protein. These phosphorylation 
events regulate cellular phenomena in signal 
transduction processes. Cellular signal transduction 
processes contain multiple steps that convert an 

0 extracellular signal into an intracellular signal. The 
intracellular signal is then converted into a cellular 
response. PTKs are components in many signal 
transduction processes. A PTK regulates the flow of a 
signal in a particular step in the process by 

5 phosphorylating a downstream molecule. The addition of 
a phosphate can either modulate the activity of the 
downstream molecule by turning it "on" or "off". Thus, 
aberrations in a particular PTK's activity can either 
cause overflow or underflow of the signal. Overflow of 

0 a signal can lead to such abnormalities as uncontrolled 
cell proliferation, which is representative of such 
disorders as cancer and angiogenesis . 

Scientists in the biomedical community are 
searching for PTK inhibitors that down-regulate overflow 

5 signal transduction pathways. In particular, small 

molecule PTK inhibitors are sought that can traverse the 
cell membrane and not become hydrolyzed in acidic 
environments. These small molecule PTK inhibitors can 
be highly bioavailable and can be administered orally to 
0 patients. 

Some small molecule PTK inhibitors have already 
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been discovered. For example, bis (monocyclic) , bi cyclic 
or hecerocyclic aryl compounds (?CT WO 92/20642), 
vir.yiene-azair.dole derivatives !?CT WO 94/14308} , 1- 
cyciopropyl-4-pyridyl-quinolones (U.S. Pacenc No. 
5 5,330,992), scyryl compounds (U.S. Patent No. 

5,217,999), styryl- substituted pyridyl compounds (U.S. 
Patent No. 5,302,606), certain quinazoline derivatives 
(EP Application No. 0 566 266 Al) , seleoindoles and 
selenides (PCT WO 94/03427) , tricyclic polyhydroxylic 
10 compounds (PCT WO 92/21660) , and benzylphosphonic acid 
compounds (PCT WO 91/15495) are described as PTK 
inhibitors . 

Although many PTK inhibitors are known, many of 
these are not specific for PTK subfamilies and will 

15 therefore cause multiple side-effects as therapeutics. 
Compounds of the indolinone family, however, are 
specific for the FGFR subfamily and are non- 
hydrolyzable. WO 96/40116, n Indolinone Compounds for 
the Treatment of Disease," published December 19, 1996, 

20 inventors Tang et al. Although the use of X-ray 
crystallography has provided three dimensional 
structures of other PTKs, they are not complexed with 
PTK subfamily specific, hydrolysis resistant, small 
molecules . 

25 Despite recent advances, the need remains in the 

art for crystal lographic analysis of protein kinases, so 
that improved therapeutic molecules can be designed and 
synthesized. 

30 SUMMARY OP THE INVENTION 

The present invention relates to the three 
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dimensional structures of protein tyrosine kinases. The 
use of X-ray crystallography can define the three 
dimensional structure of protein tyrosine kinase at 
atomic resolution. 
5 The three dimensional structures described herein 

elucidate specific interactions between protein tyrosine 
kinases and compounds bound to them. The coordinates 
that define the three dimensional structures of protein 
tyrosine kinases are useful for determining three 

10 dimensional structures of PTKs with unknown structure. 
In addition, the coordinates are also useful for 
designing and identifying modulators of protein tyrosine 
kinase function. These modulators are potentially 
useful as therapeutics for diseases, including (but 

15 limited to) cell proliferative diseases, such as cancer, 
angiogenesis , atherosclerosis, and arthritis. 

Thus in a first aspect, the invention features a 
■crystalline form of a polypeptide corresponding to the 
catalytic domain of a protein tyrosine kinase. 

20 The term "crystalline form," in the context of the 

invention, is a crystal formed from an aqueous solution 
comprising a purified polypeptide corresponding to the 
catalytic domain of a PTK. A crystalline form of a 
protein tyrosine kinase is characterized as being 

25 capable of diffracting x-rays in a pattern defined by 
one of the crystal forms depicted in Blundel et al . , 
1976, Protein Crystal lography . Academic Press. A 
crystalline form of a protein kinase is not 
characterized as being capable of diffracting x-rays in 

3 0 a pattern analogous to a crystalline form consisting of 
primarily salt or primarily a compound, for example. 
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The term "protein tyrosine kinase," or ?7K , refers 
to an er.zyr.e that transfers the high energy phcsohate of 
adenosine triphosphate to a tyrosine residue located on 
a protein target . 
5 A protein tyrosine kinase catalytic domain of the 

invention can originate from receptor protein tyrosine 
kinases that bind fibroblast growth factor (FGF) . These 
protein tyrosine kinases are known as w FGFR" herein, and 
can relate to one member of the FGFR family, such as 
10 FGFR1. 

The term "catalytic domain" refers to the region of 
a protein that can exist as a separate entity from the 
protein. The catalytic domain of a protein tyrosine 
kinase is characterized as having considerable amino 

15 acid identity to the catalytic domain of other protein 
tyrosine kinases. Considerable amino acid identity 
preferably refers to at least 30% identity, more 
preferably at least 35% identity, and most preferably at 
least 4 0% identity. These degrees of amino acid 

20 identity refer to the identity between different protein 
tyrosine kinase families. Amino acid identity for 
members of a given protein tyrosine kinase family range 
from 55% to 90%. The catalytic domain may be functional 
as a separate entity. The catalytic domain of a protein 

25 tyrosine kinase is also characterized .as a polypeptide 
that is soluble in solution. 

The term "identity" identity as used herein refers 
to a property of sequences that measures their 
similarity or relationship. Identity is measured by 

3 0 dividing the number of identical residues in the two 
sequences by the total number of residues and 
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multiplying the produce by 10C. Thus, two copies of 
exactly the same sequence have 100% identity, but 
sequences that are less highly conserved and have 
deletions, additions, or replacements have a lower 
5 degree of identity. Those skilled in the art will 

recognize that several computer programs are available 
for determining sequence identity. 

The term "functional" refers to the ability of a 
catalytic domain to convert a substrate into a product 

10 by phosphorylating the substrate. The term "functional 
also relates to the ability of a catalytic domain to 
bind natural binding partners. The catalytic region 
may comprise an N-terminal tail, a catalytic core, and 
C-terminal tail. The catalytic core is a polypeptide 

15 that can be functional in terms of catalysis. N- and C 
terminal tails are polypeptide regions that may not 
confer appreciable functionality in terms of catalysis, 
but may confer functionality in terms of modulator 
specificity. 

2 0 A polypeptide can exist as a catalytic domain 

eventhough it is not functional. For example, a 
polypeptide corresponding to a catalytic domain may not 
be functional if it does not harbor phosphate moieties 
in key areas. Multiple examples of phosphorylation- 
25 state dependent function are well documented in the art 
Therefore, a catalytic domain can also exist without 
being functional. A measure of a protein kinase 
catalytic domain is a polypeptide that is homologous to 
other protein kinase catalytic domains. 

3 0 The term "polypeptide" refers to an amino acid 

chain representing a portion of, or the entire sequence 
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of, amino acids comprising a protein. 

A preferred embodiment of the invencicn includes a 

crystal line form of a ?TK chat is a receptor PTK . 

Receptors are proteins that straddle the inside and 
5 outside of the cell membrane. Receptor PTKs comprise an 

extracellular region, a transmembrane region, and an 

intracellular region comprising a catalytic domain. 

Another preferred embodiment of the invention is 

the crystalline form of a receptor PTK selected from the 
10 group consisting of FGF-R, PDGF-R, FLK, CCK4 , MET, TRKA, 

AXL , TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1 , and MUSK. 
Yet another preferred embodiment of the invention 

is the crystalline form of a PTK that is a non-receptor 

PTK . Non-receptor PTKs are located inside the cell and 
15 do not harbor extracellular or membrane -spanning 

polypeptides attached to the polypeptide corresponding 

to the catalytic domain. Non-receptor PTKs may harbor 

fatty acids or lipids, which can impart a membrane 

associated character to a PTK. In preferred embodiments 
20 of the invention, crystalline forms of non- receptor PTKs 

are selected from the group consisting of SRC, BRK, BTK, 

CSK, ABL, ZAP70 , FES, FAK, JAK, and ACK. 

In still another preferred embodiment, the 

invention features a crystalline form of a PTK that 
25 comprises a heavy metal atom. These types of crystals 

can be referred to as derivative crystals. 

The term "derivative crystal" refers to a crystal 

where the polypeptide is in association with one or more 

heavy-metal atoms. 
30 The term "association" refers to a condition of 

proximity between a chemical entity or compound, or 
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portions or fragments thereof, and tyrosine kinase 
domain protein, or portions or fragments thereof. The 
association may be non-covalent , i.e., where the 
juxtaposition is energetically favored by, e.g., 
5 hydrogen- bonding; van der Waals, electrostatic or 
hydrophobic interactions, or it may be covalent. 

The term "heavy metal atom" refers to an atom that 
is a transition element, a lanthanide metal, or an 
actinide metal. Lanthanide metals include elements with 
10 atomic numbers between 57 and 71, inclusive. Actinide 
metals include elements with atomic numbers between 89 
and 103, inclusive. 

In a preferred embodiment, the invention features a 
crystal of an FGF receptor tyrosine kinase domain 
15 protein. The FGF receptor tyrosine kinase domain 
protein can relate to FGFR1 . 

The term "FGFRl" refers to one member of multiple 
receptor PTKs that are homologous to one another and 
bind FGF. In this context, the term "homologous" refers 

2 0 to at least 70% amino acid identity between two members 

of the FGFR family. 

The term * FGFRl" can also refer to a mutant of 
human FGFRl which is characterized by the amino acid 
sequence of SEQ ID NO: 2. As compared to human FGFR1, 
25 FGFR1 contains the following amino acid substitutions: 
Cys-488 - Ala, Cys-584 - Ser, Leu-457 - Val, and has an 
additional five amino acid residues at the N-terminus 
(Ser-Ala-Ala-Gly-Thr) . 

The term "human FGFR1 " refers to the tyrosine 

3 0 kinase domain of human fibroblast growth factor receptor 

1 ( " FGFRl " ) having the amino acid sequence of SEQ ID . 
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NO: I. Generally, human FGFRI comprises a 310 amino acid 
residue fragment (residues 455 :o 765) cf human FGFRi . 

The cerm "mutant" refers :o a polypeptide which is 
obtained by replacing at least one amino acid residue in 
5 a nacive tyrosine kinase domain with a different amino 
acid residue. Mutation can be accomplished by adding 
and/or deleting amino acid residues within the native 
polypeptide or at the N- and/or C-terminus of a 
polypeptide corresponding to a native tyrosine kinase 
10 domain having substantially the same three-dimensional 
structure as the native tyrosine kinase domain from 
which it is derived. By having substantially the same 
three-dimensional structure is meant having a set of 
atomic structure coordinates .that have a root mean 
15 square deviation (r.m.s.d.) of less than or equal to 
about 2 A when superimposed with the atomic structure 
■coordinates of the native tyrosine kinase domain from 
which the. mutant is derived when at least about 5 0% to 
100% of the Ca atoms of the native tyrosine kinase are 
20 included in the superposition. A mutant may have, but 
need not have, PTK activity. 

In another preferred embodiment, the invention 
relates to a crystalline form defined by the structural 
coordinates set forth in Table 1. 
25 The term "atomic structural coordinates" as used 

herein refers to a data set that defines the three 
dimensional structure of a molecule or molecules. 
Structural coordinates can be slightly modified and 
still render nearly identical three dimensional 
3 0 . structures. A measure of a unique set of structural 
coordinates is the root -mean -square deviation of the 
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resulting structure. Structural coordinates that render 
three dimensional structures that deviate from one 
another by a root -mean- square deviation of less than 1.5 
A may be viewed by a person of ordinary skill in the art 
5 as identical. Hence, the structural coordinates set 

forth in Table 1, Table 2, Table 3, and Table 4 are net 
limited to the values defined therein. 

In other preferred embodiments, the invention 
features a crystalline form of the polypeptide in 

10 association with a compound. These types of crystalline 
forms can be referred to as co-crystals. The compound 
may be a cof actor, substrate, substrate analog, 
inhibitor, or allosteric effector. 

The term "compound" refers to an organic molecule. 

15 The term "organic molecule" refers to a molecule which 
has at least one carbon atom in its structure. The 
compound can have a molecular weight of less than 6kDa. 
Both the geometry of the compound and the interactions 
formed between the compound and the polypeptide 

2 0 preferably govern high affinity binding between the two 

molecules. High affinity binding is preferably governed 
by a dissociation equilibrium constant on the order of 
10** M or less. The compound is preferably a modulator 
that alters the function of a PTK. 
25 The term "function," in reference to the effect of 

a modulator on PTK function, refers to the ability of a 
modulator to enhance or inhibit the catalytic activity 
of a PTK. 

The term "catalytic activity" , in the context of 

3 0 the invention, defines the ability of a PTK to 

phosphorylate a substrate polypeptide. Catalytic 
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activity can be measured, for example, by decerT.::.;r.g 
the amount of a substrate converted to a product as a 
function of time. The conversion of the substrate to a 
product occurs at the active-site of the PTK. 
5 The term "active- site" refers to a cavity located 

in the ?TK in which one or more substrate molecules may 
bind. Addition of a modulator to cells expressing a ?TK 
may enhance (activate) or lower (inhibit) the catalytic 
activity of the PTK . 

10 A small number of inhibitors of PTK catalytic 

activity are known in the art. Small molecule 
inhibitors may modulate PTK function by blocking the 
binding of substrates. Indolinone compounds, for 
example, may bind to the active-site of PTK catalytic 

15 domains and inhibit them effectively, as measured by 
inhibition constants on the order of 10" 6 M or less. 

Activators of PTK intracellular regions can enhance 
PTK function by interacting with both the PTK catalytic 
domain and the substrate. Activators may also promote 

20 dimerization of PTKs and thus activate them by bringing 
them into close proximity with one another. In 
addition, activators may operate by promoting a 
conformational change in the intracellular region of the 
PTK such that the catalytic region modifies substrates 

25 at a faster rate in the presence of the activator. 

The term "function" can also refer to the ability 
of a modulator to enhance or inhibit the association 
between a PTK and a natural binding partner. 

The term "natural binding partner* refers to a 

30 polypeptide that normally binds to a PTK in a cell. 
These natural binding partners can play a role in 
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propagating a signal in a ?TK signal transduction 
process. The natural binding partner can bind to a ?TK 
with high affinity. High affinity represents an 
equilibrium binding constant on the order of 10*' M or 
5 less. However, a natural binding partner can also 

transiently interact with a PTK and chemically modify 
it. PTK natural binding partners are chosen from a 
group consisting of, but not limited to, src homology 2 
(SH2) or 3 (SH3) domains, other phosphoryl tyrosine 

10 binding ( ?TB) domains, nucleotide exchange factors, and 
other protein kinases or protein phosphatases. 

The term "interactions" refers to hydrophobic, 
aromatic, and ionic forces and hydrogen bonds formed 
between atoms in the modulator and the enzyme active - 

15 site. 

The term "cofactor" refers to a compound that may, 
in addition to the substrate, bind to a protein and 
undergo a chemical reaction. Multiple co- factors are 
nucleotides or nucleotide derivatives, such as phosphate 

20 and nicotinamide derivatives of adenosine. 

The term "substrate" refers to a compound that 
reacts with an enzyme. Enzymes can catalyze a specific 
reaction on a specific substrate. For example, PTKs can 
phosphorylate specific protein and peptide substrates on 

25 tyrosine moieties. In addition, nucleotides can act as 
substrates for protein kinases. 

The term "substrate analog" refers to a compound 
that is structurally similar, but not identical, to a 
substrate. The substrate analog may be a nucleotide 

30 analog. Examples of nucleotide analogs are described 
below. 
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The term " inhibitor" refers :o a compound that 
decreases the cellular function of a protein kinase. 
The protein kinase function is preferably the 
interaction with a natural binding partner and rr.ore 
5 preferably catalytic activity. 

The term "allosteric effector" refers to a compound 
that causes allosteric interactions in a protein. The 
term "allosteric interactions" refers to interactions 
between separate sites on a protein. The sites can be 
10 different from the active site. The allosteric effector 
can enhance or inhibit catalytic activity by binding to 
a site that may be different than the active site. 

The term "co-crystal" refers to a crystal where the 
polypeptide is in association with one or more 
15 compounds. 

In preferred embodiments, a co-crystal of the 
invention can be in association with a heavy metal atom. 
Examples of heavy metal atoms are described above . 
In other preferred embodiments, the invention 
20 features a co-crystal comprising the crystalline form of 
the polypeptide in association with a compound, where 
the compound is a non-hydrolyzable analog of ATP. These 
analogs can be referred to as nucleotide analogs. 

The term "ATP" refers to the chemical compound 
25 adenosine triphosphate. 

The term w non-hydrolyzable" refers to a compound 
having a covalent bond that does not readily react with 
water. Examples of non-hydrolyzable analogs of ATP are 
AMP-PNP and AMP-PCP, whose structures are well known to 
30 those skilled in the art. 

The term "AMP-PNP" refers to adenylyl 
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imidodiphosphate, a non-hydroiyzable analog of AT? . 

The term "AM? - PC?" refers to adenylyl 
dichosphonate, a r.on-hydrolyzabie analogue of AT?. 

In another preferred embodiment, the invention 
relates to a crystalline form defined by the structural 
coordinates set forth in Table 2. 

In preferred embodiments, the invention relates to 
crystalline forms, where the compound in association 
with the polypeptide is an indolinone. 

Certain indolinones are specific modulators of PTK 
function. A preferred embodiment of the invention is 
the crystalline form of a PTK complexed with an 
indolinone of formula I or II: 
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or a pharmaceutical^ acceptable salt, isomer, 

mecabolite, ester, amide, or prodrug thereof, where: 

(a) A lf A 2 , A 3 , and A< are independently carbon or 
5 nitrogen; 

(b) R x is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiolindolinone; 

(d) R 3 is hydrogen; 

10 (e) R 4 , R 5 , R 6 , and R 7 are optionally present, and 

are either (i) independently selected from the group 
consisting of alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihalomethyl, S(0)R, S0 2 NRR* , S0 3 R, 
SR, .N0 2 , NRR* , OH, CN, C(0)R, 0C(0)R, NHC(0)R, (CH 2 ) n C0 2 R, 

15 and CONRR' or (ii) any two adjacent R«, R 5 , R 6 , and R 7 
taken together form a fused ring with the aryl portion 
of the indole-based portion of the indolinone; 

(f) R 2 ', R 3 ', R 4 ' , R 5 ', and R 6 ' are each 
independently selected from the group consisting of 

20 hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 

alkaryloxy, halogen, trihalomethyl , S(0)R, S0 2 NRR' , S0 3 R, 
SR, N0 2 , NRR 1 , OH, CN, C(0)R, 0C(0)R, NHC{0)R, <CH 2 ) n C0 2 R, 
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and CONRR ' ; 

(g) n is 0 , 1, 2, or 3 ; 

(h) R is hydrogen, aikyl cr aryl; 

(i) R' is hydrogen, aikyl or aryl; and 

5 ( j ) A is a five membered heceroaryl ring selected 

from the group consisting of thiophene, pyrrole, 
pyrazole, imidazole, 1,2, 3-triazole, 1, 2 , 4 -triazoie , 
oxazole, isoxazole, thiazole, isothiazole, furan, 1,2,3- 
oxadiazole , 1,2, 4 -oxadiazole , 1,2, 5-oxadiazole , 1,3,4- 

0 oxadiazole, 1,2,3, 4 -oxatriazole , 1,2,3, 5-oxatriazole , 

1.2. 3- thiadiazole, 1 , 2 , 4 - thiadiazole , 1,2, 5 - thiadiazole, 

1. 3. 4 - thiadiazole, 1,2, 3 , 4 - thiatriazole , 1,2,3,5- 
thiatriazole , and tetrazole, optionally substituted at 
one or more positions with aikyl, alkoxy, aryl, aryloxy, 

5 aikaryl,. alkaryloxy, halogen, trihalomethyl , 3(0)R, 
"S0 2 NRR 1 , S0 3 R, SR, N0 2 , NRR 1 , OH, CN, C(0)R, OC(0)R, 
NHC(0)R, (CH 2 ) n C0 2 R or CONRR ' . 

The term "pharmaceutical^ acceptable salt" refers 
to those salts which retain the biological activity and 
0 properties of the free bases. Pharmaceutical^ 

acceptable salts can be obtained by reaction with 
inorganic acids such as hydrochloric acid, hydrobromic 
acid, sulfuric acid, nitric acid, phosphoric acid, 
methanesulfonic acid, ethanesulf onic acid, p- 
5 toluenesulfonic acid, salicylic acid and the like. 

The term "prodrug" refers to an agent that is 
converted into the parent drug in vivo. Prodrugs may be 
easier to administer than the parent drug in some 
situations. For example, the prodrug may be 
0 bioavailable by oral administration but the parent is 

not, or the prodrug may improve solubility to allow for 
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intravenous administration . 

"Alkyl " refers :o a straight -chain, branched or 
cyclic saturated aliphatic hydrocarbon. Preferably, the 
alkyl group has 1 to 12 carbons. More preferably, it is 

5 a lower alkyl of from 1 to 7 carbons, T.ore preferably 1 
to 4 carbons. Typical alkyl groups include methyl, 
ethyl, propyl, isopropyl, butyl, isobutyi, tertiary 
butyl, pencyl, hexyl and the like. The alkyl group may 
be optionally substituted with one or more substituents 

0. are selected from the group consisting of hydroxyl , 

cyano, aikoxy, =0, = S, N0 2 , halogen, N(CH 3 ) 2 amino, and 
SH. 

"Alkenyi" refers to a straight -chain, branched or 
cyclic unsaturated hydrocarbon group containing at least 

5 one carbon-carbon double bond. Preferably, the alkenyi 
group has 2 to 12 carbons. More preferably it is a 
lower alkenyi of from 2 to 7 carbons, more preferably 2 
to 4 carbons. The alkenyi group may be optionally 
substituted with one or more substituents selected from 

0 the group consisting of hydroxyl, cyano, aikoxy, =0, =S, 
N0 a , halogen, N(CH 3 ) 2 amino, and SH. 

"Alkynyl" refers to a straight -chain, branched or 
cyclic unsaturated hydrocarbon containing at least one 
carbon-carbon triple bond. Preferably, the alkynyl 

5 group has 2 to 12 carbons. More preferably it is a lower 
alkynyl of from 2 to 7 carbons, more preferably 2 to 4 
carbons. The alkynyl group may be optionally 
substituted with one or more substituents selected from 
the group consisting of hydroxyl, cyano, aikoxy, =0, =S, 

0 N0 2/ halogen, N(CH 3 ) 2 amino, and SH. 

"Aikoxy" refers to an "O-alkyl" group. 
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"Aryl" refers :o an aroma- ic group which has at 
least one ring having a conjugated pi-electron system 
and includes carbocyclic ary 1 , heterocyclic aryl and 
biaryl groups. The aryl group may be optionally 
5 substituted with one or more substicuents selected from 
the group consisting of halogen, trihalomethyl , 
hydroxy 1, SH, OH, N0 2 , amine, thioether, cyano, aikoxy, 
alkyl, and amino. 

"Alkaryl" refers to an alkyl that is covalently 
10 joined to an aryl group. Preferably, the alkyl is a 
lower alkyl. 

"Carbocyclic aryl" refers to an aryl group wherein 
the ring atoms are carbon. 

"Heterocyclic aryl" refers to an aryl group having 
15 from 1 to 3 heteroatoms as ring atoms, the remainder of 
the ring atoms being carbon. Heteroatoms include 
oxygen, sulfur, and nitrogen. Thus, heterocyclic aryl 
groups include furanyl, thienyl, pyridyl, pyrrolyl, N- 
lower alkyl pyrrolo, pyrimidyl, pyrazinyl, imidazolyl 
2 0 and the like. 

"Amide" refers to -C(0)-NH-R, where R is alkyl, 
aryl, alkylaryl or hydrogen. 

"Thioamide" refers to -C(S)-NH-R, where R is alkyl, 
aryl, alkylaryl or hydrogen. 
25 "Amine" refers to a -N(R , )R 11 group, where R' and 

R 1 1 are independently selected from the group consisting 
of alkyl, aryl, and alkylaryl. 

"Thioether" refers to -S-R, where R is alkyl, aryl, 
or alkylaryl. 

30 "Sulfonyl" refers to -S(0) 2 -R, where R is aryl, 

C(CN)«C-aryl, CH 2 CN, alkyaryl, sulfonamide, NH -alkyl, NH- 
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aikylaryl, cr NK-aryl. 

The term "acyl" denotes groups -C(OJR, where R is 
a iky I as defined above, such as formyi, acet vi , 
prcpionyi, or bucyryl . 

I- is understood by chose skilled in the art -hat 
when A,, A 2 , A J# and A, are nitrogen or sulfur that the 
corresponding R 4 , R 5 , R 6 , and R 7 , as well as the 
corresponding bond, do not exist. 

Examples of indoles having such fused rings (as 
described in (e) (ii) above include the following: 




The six membered rings shown above exemplify 
possible A rings in compound II. 
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Other preferred embodiments or. the invent ion are 
crystalline forms comprising 3 - [ ( 3 - ( 2 -carbcxyethyl : - 4 - 
methylpyrrol - 5 -yl ) methylene] - 2 - indclinone as well as 3- 
[4 - (4 - f ormyipiperazme- 1 - yl - ) benzyl idenyl] - 2 - mdoli none . 
5 The polypeptide of these crystalline forms can be FGFR , 
and specifically, FGFR1 . 

In preferred embodiments; the crystalline forms of 
the invention can be defined by the structural 
coordinates set forth in Table 3 or Table 4. 

10 The ase of X-ray crystallography can elucidate the 

three i , r.ensional structure of crystalline forms of the 
invention. The first characterization of crystalline 
forms by X-ray crystallography can determine the unit 
cell shape and its orientation in the crystal. 

15 in other preferred embodiments , the invention 

features a crystal of an FGF receptor tyrosine kinase 
domain protein, where the crystal is characterized by 
having monoclinic unit cells. The crystal may also be 
characterized by having space group symmetry C2 . 

20 The term "unit cell" refers to the smallest and 

simplest volume element (i.e., parallelpiped- shaped 
block) of a crystal that is completely representative of 
the unit of pattern of the crystal. The dimensions of 
the unit cell are defined by six numbers: dimensions a, 

2 5 b and c and angles cr, 3 and Y- A crystal can be viewed 
as an efficiently packed array of multiple unit cells. 
Detailed descriptions of crystallographic terms are 
described in, which is hereby incorporated herein by 
reference in its entirety, including any drawings, 
30 figures, and tables. 

The term "monoclinic unit cell" refers to a unit 
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cell where a ? b * c; a = v = 90 3 ; and 3 > 9 0° . 

The term "space group" refers to the symmetry of a 
ur.it ceil. In a space group designation (e.g., C2) the 
capital letter indicates the lattice type and the other 
5 symbols represent symmetry operations that can be 
carried out on the unit cell without changing its 
appearance. 

The term "lattice" in reference to crystal 
structures refers to the array of points defined by the 
10 vertices of packed unit cells. 

The term "symmetry operations" refers to 
geometrically defined ways of exchanging equivalent 
parts o£ a unit cell, or exchanging equivalent molecules 
between two different unit cells. Examples of symmetry 
15 operations are screw axes, centers of inversion, and 
mirror planes. 

In a preferred embodiment, the invention features* a 
crystalline form, where the monoclinic unit cells have 
dimensions of about a-208.3 A, b=57.8 A, c=65.5 A and 
20 3=107.2°. 

In a preferred embodiment, the invention features a 
FGFR1 crystal, where the monoclinic unit cells have 
dimensions of about a-211.6 A, b«51.3 A, c«66.1 A and 
3=107,7°. 

25 In another aspect the invention features a 

polypeptide corresponding to the catalytic domain of a 
protein tyrosine kinase, containing at least about 20 
amino acid residues upstream of the first glycine in the 
conserved glycine-rich region of the catalytic domain, 

30 and at least about 17 amino acid residues downstream of 
the conserved arginine located at the C-terminal 
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boundary cf the catalytic dona in. 

The polypeptides of the invention can be isolated, 
enriched or purified. In addition, the crystalline 
forms of the invention can be formed frorr. polypeptides. 
5 that are isolated, enriched, or purified. 

By "isolated" in reference to a polypeptide is 
meant a polymer of 6, 12, 13 or more amino acids 
conjugated to each other, including polypeptides that 
are isolated from a natural source or that are 
10 synthesized. The isolated polypeptides of the present 
invention are unique in the sense that they are not 
found in a pure or separated state in nature. Use of 
the term "isolated" indicates that a naturally occurring 
sequence has been removed from its normal cellular 
15 environment. Thus, the sequence may be in a cell -free 
solution or placed in a different cellular environment. 
The term does not imply that the sequence is the only 
amino acid chain present, but that it is essentially 
free (about 90 - 95% pure at least) of material 
20 naturally associated with it. 

By the use of the term "enriched" in reference to a 
polypeptide it is meant that the specific amino acid 
sequence constitutes a significantly higher fraction (2 
- 5 fold) of the total of amino acids present in the 
25 cells or solution of interest than in normal or diseased 
cells or in the cells from which the sequence was taken. 
This could be caused by a person by preferential 
reduction in the amount of other amino acids present, or 
by a preferential increase in the amount of the specific 
3 0 . amino acid sequence of interest, or by a combination of 
the two. However, it should be noted that "enriched" 
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does not imply that there are no other amino acid 
sequences present, lust that the relative amount of the 
sequence of interest has been significantly increased. 
The term significant here is used to indicate that the 
5 level of increase is useful to the person making such an 
increase, and generally means an increase relative to 
other amino acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term 
also does not imply that there are no amino acids from 

10 other sources. The other source amino acids may, for 
example, comprise amino acids encoded 'by a yeast or 
bacterial genome, or a cloning vector such as pUC19. 
The term is meant to cover only those situations in 
which a person has intervened to elevate the proportion 

15 of the desired nucleic acid. 

It is also advantageous for some purposes that an 
amino acid sequence be in purified form. The term 
"purified" in reference to a polypeptide does not 
require absolute purity (such as a homogeneous 

20 preparation) ; instead, it represents an indication that 
the sequence is relatively purer than in the natural 
environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of 
mg/ml) . Purification of at least one order of 

25 magnitude, preferably two or three orders, and more 

preferably four or five orders of magnitude is expressly 
contemplated. The substance is preferably free of 
contamination at a functionally significant level, for 
example 90%, 95%, or 99% pure. 

30 In a preferred embodiment, the invention features a 

polypeptide corresponding to the catalytic domain of a 
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receptor ?TK. The recepcor PTK may have a three- 
dimensional structure substantially similar to that of 
the insulin receptor, even though the amino acid content 
may be different. 
5 In a preferred embodiment, the invention features a 

polypeptide corresponding to the catalytic domain of a 
non-receptor PTK, where the non- insulin receptor 
tyrosine kinase is a cytoplasmic tyrosine kinase. 

In a preferred embodiment, the invention features a 

10 polypeptide corresponding to the catalytic domain of a 

receptor PTK, selected from the group consisting of FGF- 
R, PDGF-R, KDR, CCK4 , MET, TRKA, AXL, TIE, EPH, RYK, 
DDR, ROS, RET, LTK, ROR1, or MUSK. 

In a preferred embodiment, the invention features a 

15 polypeptide corresponding to the catalytic domain of a 
non- receptor PTK, selected from the group consisting of 
SRC, BRK, BTK, CSK, ABL, ZAP70, FES, FAK, JAK, or ACK. 

In a preferred embodiment, the invention features a 
polypeptide corresponding to the catalytic domain of a 

2 0 PTK, having the amino acid sequence shown in Table 1 or 
Table 2. 

In another aspect, the invention features a method 
for creating crystalline forms described herein. The 
method may utilize the polypeptides described herein to 

2 5 form a crystal. The method comprises the steps of: 

(a) mixing a volume of polypeptide solution 
with a reservoir solution; and 

(b) incubating the mixture obtained in step 
(a) over the reservoir solution in a closed container, 

3 0 under conditions suitable for crystallization. 

These processes are described in detail in the 
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section entitled "Detailed Description of the 
Invention . " 

In another aspect, the invention features a rr.ethcd 
of obtaining FG? receptor tyrosine kinase domain 
5 polypeptide in crystalline form, comprising the steps 

of: (a) mixing a volume of polypeptide solution with an 
equal volume of reservoir solution, where the 
polypeptide solution comprises 1 mg/mL to 60 mg/mL FGF- 
type tyrosine kinase domain protein, 10 mM to 200 mM 

10 buffering agent, 0 mM to 20 mM dithiothreitol and has a 
pH of about 5.5 to about 7.5, and where the reservoir 
solution comprises 10% to 30% (w/v) polyethylene glycol, 
0.1 M to 0.5 M ammonium sulfate, 0% to 20% (w/v) 
ethylene glycol or glycerol, 10 mM to 200 mM buffering 

15 agent and has a pH of about 5.5 to about 7.5; and .(b) 
. incubating the mixture obtained in step (a) over said 
reservoir solution in a closed container at a 
temperature between 0° and 25°C until crystals form. 

In a preferred embodiment, the invention features a 

2 0 method of obtaining FGF receptor tyrosine kinase domain 
polypeptide in crystalline form, where the polypeptide 
solution comprises about 10 mg/mL FGF receptor tyrosine 
kinase domain, about 10 mM sodium chloride, about 2 mM 
dithiothreitol, about 10 mM Tris-HCl and has a pH of 

25 about 8; the reservoir buffer comprises about 16% (w/v) 
polyethylene glycol (MW 10000) , about 0.3 M ammonium 
sulfate, about 5% ethylene glycol or glycerol, about 100 
mM bis-Tris and has a pH of about 6.5; and the 
temperature is about 4°C. 

30 in another preferred embodiment, the invention 

features a method of obtaining FGF receptor tyrosine 
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kinase domain polypept ide m crystalline form, where the 
polypeptide solution includes a compound such as a 
cefaclor, substrate, substrate analog, inhibitor or 
al lost eric effector. 
5 In still another preferred embodiment, the 

invention features a method of obtaining FGF receptor 
tyrosine kinase domain polypeptide in crystalline form, 
where the compound is a nucleotide analog, such as a 
non-hydrolyzable analog of ATP, or an indolinone. 

10 Indolinone compounds have the general structural formula 
as described herein. 

■ In another aspect, the invention features a cDNA 
encoding an FGF receptor tyrosine kinase domain protein, 
where a coding strand of the cDNA has the nucleotide 

15 sequence of SEQ ID NO: 5. 

Another aspect of the invention relates to a method 
of determining three dimensional structures of PTKs with 
unknown structure by utilizing the structural 
coordinates of Table l, Table 2, Table 3, and Table 4. 

2 0 These methods can relate to homology modeling, molecular 

replacement, and nuclear magnetic resonance methods. 

In a preferred embodiment, the invention relates to 
a method of determining three dimensional structures of 
PTKs with unknown structures by utilizing the 
25 coordinates of Table 1, Table 2, Table 3, or Table 4 in 
conjunction with the amino acid sequences of PTKs . This 
method of homology modeling comprises the steps of: (a) 
aligning the computer representation of an amino acid 
sequence of a PTK with unknown structure with that of a 

3 0 PTK with known structure, where alignment is achieved by 

matching homologous regions of the amino acid sequences ; 



WO 98/07835 



PCT/US97/14885 



27 

Co* transferring the computer representation of a., amino 
acid structure in the ?7X sequence of known structure to 
v. a computer representation of a structure of the 
corresponding amino acid in the ?TK sequence with 
unknown structure; and (c) determining low energy 

conformations of the resulting PTK structure. 

The term "amino acid sequence" describes the order 

of amino acids in the amino acid chain comprising a 

polypeptide corresponding to the catalytic domain of a 

PTK . 

The term "aligning" describes matching the 
. beginning and the end of two or more amino acid 

sequences. Homologous amino acid sequences are placed '' 
on top of one another during the alignment process. 

The term "homologous" describes amino acids in two 
sequences that are identical or have similar side-chain 
chemical groups (e.g., aliphatic, aromatic, polar, 
negatively charged, or positively charged) . 

The term "corresponding" refers to an amino acid 
that is aligned with another in the sequence alignment 
mentioned above. 

The term "determining the low energy conformation" 
describes a process of changing the conformation of the 
PTK structure such that the structure is of low free 
energy. The PTK structure may or may not have molecules, 
such as modulators bound to it. 

The term "low free energy" describes a state where 
the molecules are in a stable state as measured by the 
process. A stable state is achieved when favorable 
interactions are formed within the complex. 

The term "favorable interactions" refers to 
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hydrophobic, aromatic, ar.d. ionic forces, and hydrogen 
bonds . 

Another preferred embodiment of the invention 
relates to a method of determining three dimensional 
structures of PTKs with unknown structure. This method 
is accomplished by applying the structural coordinates 
of Table 1, Table 2, Table 3, or Table 4 to an 
incomplete X-ray crystallographic data set for a PTK. 
The method comprises the steps of: (a) aligning the 
) positions of atoms in the unit ceil by matching electron 
diffraction data from two crystals, where one data set 
is complete and the other is incomplete; and (b) 
determining a low energy conformation of the resulting 
PTK structure. 

5 The term "incomplete data set" relates to a X-ray 

crystallographic data set that does not have enough 
information to give rise to a thre* dimensional 
structure . 

In another preferred embodiment, the invention 
0 relates to a method of determining three dimensional 
structures of PTKs with unknown structure by applying 
the structural coordinates of Table 1, Table 2, Table 3 
or Table 4 to nuclear magnetic resonance (NMR) data cf 
PTK. This method comprises the steps of: (a) 
25 determining the secondary structure of a PTK structure 
using NMR data; and (b) simplifying the assignment of 
through-space interactions of amino acids. The PTK 
structure may not be complexed with compounds or 
modulators . 

30 The term "secondary structure" describes the 

arrangement of amino acids in a three dimensional 
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structure, such as in a-helix cr 3-sheet elements. 

The term n through- space interactions" defines che 
orientation of the secondary structural elements in che 
three dimensional structure and the distances between 
5 amino acids from different portions of the amino acid 
sequence . 

The term "assignment" defines a method of analyzing 
NMR data and identifying which amino acids give. rise to 
signals in the NMR spectrum. 

10 In another aspect, the invention features a method 

of identifying potential modulators of PTK function. 
These modulators are identified by docking a computer 
representation of a structure of a compound with a 
computer representation of a cavity formed by the 

15. active-site of a PTK. The computer representation of 
the PTK active-site structure can be defined by 
structural coordinates. 

The term "chemical group" refers to moieties that 
can form hydrogen bonds, hydrophobic, aromatic, or ionic 

2 0 interactions. 

The term Mocking" refers to a process of placing a 
compound in close proximity with a PTK. The term can 
also refer to a process of finding low energy 
conformations of the compound/PTK complex. 

25 A preferred embodiment of the invention is a method 

of identifying potential modulators of PTK function. 
The method involves utilizing the structural coordinates 
or a PTK three dimensional structure. The structural 
coordinates set forth in Table 1, Table 2, Table 3, and 

30 Table 4 can be utilized. The method comprises the steps 
of: (a) removing a computer representation of a PTK 
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structure and docking a computer representation of a 
compound from a computer data base with a computer 
representation of the active-site of the ?TK; \b) 
determining a conformation of the complex with a 
5 favorable geometric fit and favorable complementary 

interactions; and (c) identifying compounds that best 
fit the PTK active-site as potential modulators of PTK 
function. The initial PTK structure may or may not have 
compounds bound to it . 

10 The term "favorable geometric fit" refers to a 

conformation of the compound- PTK complex where the 
surface area of the compound is in close proximity with 
the surface area of the active-site without forming 
unfavorable interactions. Unfavorable interactions can 

15 be steric hindrances between atoms in the compound and 
atoms in the PTK active-site. 

The term "favorable complementary interactions" 
relates to hydrophobic, aromatic, ionic, and hydrogen 
bond donating, and hydrogen bond accepting forces formed 

2 0 between the compound and the PTK active-site. 

The term "potential" qualifies the term "modulator 
of PTK function" because the potential modulator or PTK 
function has not yet been tested for activity in vitro 
or in vivo. 

25 The term w best fit* describes compounds that 

complexed the most surface area in the complex and/or 
form the most favorable complementary interactions with 
the PTK in the screen in a given experiment. 

Another preferred embodiment of the invention is a 

3 0 method of identifying potential modulators of PTK 

function. The method involves utilizing x a three 
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dimensional structure of a PTK, with or without 
compounds bound co it . The method comorises t he steos 
of: .a) modifying a computer representation of a PTK 
having one or more compounds bound to it, where the 
5 computer representations of the compound or compounds 
and PTK are defined by structural coordinates; (b) 
determining a conformation of the complex with a 
favorable geometric fit and favorable complementary 
interactions; and (c) identifying the compounds that 
10 best fit the PTK active-site as potential modulators of 
PTK function. 

The term "modifying" relates to deleting a chemical 
group or groups or adding a chemical group or groups. 
Computer representations of the chemical groups can be 

15 selected from a computer data base. 

Yet another preferred embodiment of the invention 
is a method of identifying potential modulators of PTK 
function by operating modulator construction or 
modulator searching computer programs on the compounds 

2 0 complexed with the PTK. The method comprises the steps 
of: (a) removing a computer representation of one or 
more compounds complexed with a PTK; and (b) searching a 
data base for compounds similar to the removed compounds 
using a compound searching computer program, or 

25 replacing portions of the compounds complexed with the 
PTK with similar chemical structures from a data base 
using a compound construction computer program, where 
the representations of the compounds are defined by 
structural coordinates. 

30 The term "operating" as used herein refers to 

utilizing the three-dimensional conformation of 
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molecules defined by the processes described herein in 
various computer programs. 

The lerm "similar compound" refers to a compound m 
a computer data base that has a similar geometric 
structure as compounds that can bind to a PTK. The 
similar compound can also have similar chemical groups 
as the compounds that are either bound to the PTK or 
once bound to the PTK. The similar chemical groups can 
form complementary interactions with the PTK. 

Xhe term "compound searching computer program*" 
describes a computer program that searches computer 
representations of compounds from a computer data base 
that have similar three dimensional structures and 
similar chemical groups as a compound of interest. The 
compound of interest is preferably an indolinone 
compound . 

The term "similar chemical structures" refers to 
chemical groups that share similar geometry as portions 
of the compounds in complex with the PTK or compounds 
removed from the PTK structure. Similar chemical 
structures can also refer to chemical groups that may 
form similar complementary interactions as portions of 
the compounds in complex with the PTK or compounds 
removed from the PTK structure. 

The term "replacing structures* refers to removing 
a portion of the compounds in complex with the PTK or 
compounds removed from the PTK structure and connecting 
the broken bonds to a similar chemical structure. 

The term "compound construction computer program" 
describes a computer program that replaces computer 
representations of chemical groups in a compound with 
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groups from a computer data base. The compound is 
preferably an indolinor.e compound. 

The :erm "similar three dimensional scruc:ure" 
describes cwo molecules with nearly identical shape and 
5 volume. 

In another preferred embodiment of the invention, 
the ?TK structures used in the modulator design or 
identification method of the invention are defined by 
the structural coordinates of Table 1, Table 2, Table 3, 

10 or Table 4 . 

The methods for using the crystalline forms and 
three dimensional structures of the invention can relate 
to a broad range of protein kinases. Thus, in preferred 
embodiments, the invention relates to a receptor PTK. 

15 The receptor PTK can be selected form the group 

consisting of FGF-R, PDGF-R, FLK, CCK4 , MET , TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1 , and MUSK. The 
PTK may also exist as a non-receptor PTK. The non- 
receptor PTK can be selected from the group consisting 

20 of SRC, BRK, BTK, CSK, ABL, ZAP70 , FES, FAK, JAK, and 
ACK. 

In another aspect, the invention features a 
potential modulator of PTK function identified by 
methods disclosed in the invention. 

25 A preferred embodiment of the invention is that the 

potential modulator of PTK function is an oxindolinone 
or a thiolindolinone of formula I or II. disclosed above. 

Another aspect of the invention is a method for 
synthesizing a potential modulator of PTK function or 

30 its pharmaceutical^ acceptable salts, isomers, 

metabolites, esters, amides, or prodrugs by a standard 
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synthetic method known in che ar:. Synthetic procedures 
are discussed below. 

In another aspect, the invention features a method 
of identifying a potential modulator of ?TK function as 
5 a modulator of PTK function. The method comprises the 

steps of: (a) administering a potential modulator of PTK 
function to cells; (b) comparing the level of PTK 
phosphorylation between cells not administered the 
potential modulator and cells administered the potential 
10 modulator; and (c) identifying the potential modulator 
as a modulator of PTK function based on the difference 
, in the level of PTK phosphorylation. 

The term "cells" refers to any type of cells either 
primary or cultured. Primary cells can be extracted 
15 directly from an organism while cultured cells rapidly 
divide and can be cultured in many successive rounds. 
Cells can be grown in a variety of containers including, 
but not limited to flasks, dishes, and well plates. 
The term "administer" refers to a method of 
20 delivering a compound to cells. The compound can be 
prepared using a carrier such as dimethyl sulfoxide 
(DMSO) in an aqueous solution. The aqueous solution 
comprising the compound, also termed an "aqueous 
preparation" , can be simply mixed into the medium 

2 5 bathing the layer of cells or microinjected into the 

cells themselves. The compounds may be administered to 
the cells using a suitable buffered solution. 

The term "suitable buffered solution" refers to an 
aqueous preparation of the compound that comprises a 

3 0 salt that can control the pH of the solution at low 

concentrations. Because the salt exists at low 
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concentrations, the sal* preferably does not alter the 
function of the cells. 

The term " ?TK phosphorylation" refers to the 
presence of phosphate on the PTK. Phosphates on ?TKs 
5 can be identified by antibodies that bind them 
specifically with high affinity. 

In another aspect, the invention features a method 
of identifying a potential modulator of PTK function as 
a modulator of PTK function. The method comprises the 
10 steps of: (a) administering a potential modulator of PTK 
function to cells; (b) comparing the level of cell 
growth between cells not administered the potential 
modulator and cells administered the potential 
modulator; and (c) identifying the potential modulator 
15 as a modulator of PTK function based on the difference 
■ in cell growth. 

The term u cell growth" refers to the rate at which 
a group of cells divides. Cell division rates can be 
readily measured by methods utilized by those skilled in 
2 0 the art. 

Another aspect of the invention features a method 
of diagnosing a disease by identifying cells harboring a 
PTK with inappropriate activity. The method comprises 
the steps of: (a) administering a modulator of PTK 
25 function to cells; (b) comparing the rate of cell growth 
between cells not administered the modulator and cells 
administered the modulator; and (c) diagnosing a disease 
by characterizing cells harboring a PTK with 
inappropriate activity from th effect of the modulator 
30 on the difference in the rate of cell growth. The 
modulator can be identified by the methods of the 
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invention . 

The term "inappropriate activity" refers to a ?TK 
that regulates a step in a signal transduction process 
at a higher or lower rate than normal cells. 
5 Aberrations in the rate of signal transduction can oe 
caused by alterations in the stimulation of a receptor 
?TK by a growth factor, alterations in the activity of 
?TK-specific phosphatase, over-expression of a PTK in a 
cell, or mutations in the catalytic region of the PTK 

10 itself. 

The term "signal transduction process" describes 
the. steps in a cascade of events where an extracellular 
signal is transmitted into an intracellular signal. 

The term "PTK-specif ic phosphatase" describes an 

15 enzyme that dephosphorylates a particular PTK and 
thereby regulates that PTK's activity. 

Another aspect of the invention is a method of 
treating a disease associated with a PTK with 
inappropriate activity in a cellular organism, where the 

20 method comprises the steps of: (a) administering the 
modulator of PTK function to the organism, where the 
modulator is in an acceptable pharmaceutical 
preparation; and (b) activating or inhibiting the PTK 
function to treat the disease. 

25 The term "organism" relates to any living being 

comprised of at least one cell. An organism can be as 
simple as one eukaryotic cell or as complex as a mammal. 

The term "administering" , in reference to an 
organism, refers to a method of introducing the compound 

3 0 to the organism. The compound can be administered when 
the cells or tissues of the organism exist within the 
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organism or ourside of the organism. Cells existing 
outside the organism can be maintained or grown in cell 
cul-ure dishes. For ceils harbored within the organism, 
many techniques exist in the art to administer 
5 compounds, including (but not limited to) oral, 

parenteral, dermal, and injection applications. For 
cells outside of the patient, multiple techniques exist 
in the art to administer the compounds, including (but 
not limited to) cell microinjection techniques, 

10 transformation techniques, and carrier techniques. 

The term "pharmaceut ically acceptable composition" 
refers to a preparation comprising the modulator of PTK 
activity. The composition is acceptable if it does not 
appreciably cause irritations to the organism 

15 administered the compound. 

Preferred embodiments of the of the invention are 
that the PTK is a receptor PTK selected from the group 
consisting of FGF-R, PDGF-R, FLK-1, CCK4 , MET, TRKA, 
AXL, TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1, and MUSK. 

20 Other preferred embodiments of the invention are that 
the PTK is a non- receptor PTK selected from the group 
consisting of SRC, BRK, BTK, CSK, ABL, ZAP70, FES, FAK, 
JAK, and ACK. 

The summary of the invention described above is 

25 non- limiting and other features and advantages of the 
invention will be apparent from the following detailed 
description, and from the claims. 



30 



BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 provides a ribbon diagram of the structure 
of FGFR1 showing the side chains of tyrosines Tyr-653 
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and Tyr - 6 54 and the ct helical (aC, aD , ztZ , c*ZF , aF-al), 
3 strand (BI-B5, 37, 38) , nucleotide -binding loop, 
catalytic loop, activation loop and kinase insert 
regions of the molecule. The termini are denoted by N 
5 and C. The loop between B2 and £3 is disordered, 
indicated by a break in the chain in this region. 

FIG. 2 provides a stereo view of a C a trace of FGFRI 
shown in the same orientation as FIG. 1, with every 
tenth amino acid residue marked with a filled circle and 

10 every twentieth amino acid residue labeled with a 
residue number . 

FIG. 3 provides a structure-based sequence 
alignment of human fibroblast growth factor receptor 1 
(FGFRI), human fibroblast growth factor receptor 2 

15 ( FGFR2 ) , human fibroblast growth factor receptor 3 
( FGFR3 ) , human fibroblast growth factor receptor 4 
( FGFR4 ) , a D. malanogaater homolog (DFGFR1) , a C. 
elegans homolog (EGL-1S) and insulin receptor tyrosine 
kinase (IRK) . 

20 FIGS. 4A and 4B provide ribbon diagrams of the 

N- terminal lobes (4A) and C- terminal lobes (4B) of FGFRI 
and IRK in which the C a atoms of the 3 sheets (4A) or ot- 
helices (4B) of the two proteins have been superimposed. 
FIG. 5 illustrates the side-chain positions of the 

25 tyrosine autophosphorylation sites of FGFRI on the 
backbone representation of FGFRI. 

FIGS. 6A and 6B are amino acid sequence alignments 
of the catalytic domains of PTKs, including receptor and 
non-receptor type PTKs. FIG. 6A depicts one 

3 0 representative member from each of the eighteen 

subfamilies of receptor tyrosine kinases. FIG. 6B 
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depicts one representative member from each of che 
subfamilies of cytoplasmic tyrosine kinases. In FIGS. 
6A and 63 highly conserved residues are boxed. The 
position of the glycine-rich domain, kinase insert, 
catalytic loop, and activation loop are indicated. The 
numbering is for human FGF-receptor . 

BRIEF DESCRIPTION OF THE CRYfiTAT.T.OfiP APWTr atomtp 

STRUCTURAL COORDINATES 
The crystallographic structural coordinates are 
located at the end of the section entitled "Examples" 
and before the claims. Three sets of coordinates can be 
found in the Protein Data Bank under accession names 
1FGK, 1AGW, and 1FGI . The 1FGK coordinates correspond 
to those listed in Table 1, the 1AGW coordinates 
correspond to those listed in Table 4, and the 1FGI 
coodinates correspond to those listed in Table 3 . The 
1AGW and 1FGI coordinate sets will be publically 
available in March 1998. 

Table 1 provides the atomic structure coordinates 
of native FGFR1 crystals of the invention as determined 
by X-ray crystallography; and 

Table 2 provides the atomic structure coordinates 
of FGFR1 : AMP - PCP co-crystals of the invention as 
determined by X-ray crystallography. 

Table 3 lists crystallographic coordinates defining 
the three dimensional structure of FGF-R1 complexed with 
3- [ (3- (2-carboxyethyl) -4-methylpyrrol-5-yl) methylene] -2- 
indolinone. The columns (from left to right) are 
descriptions of the atoms by number and type, amino acid 
and number containing the atom, the x coordinate, y 
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coordinate, z coordinate, bond connectivity, and 
temperature factor. All of zhese parameters are well 
defined in the art. 

Table 4 is a file of crystaliographic coordinates 
5 defining the three dimensional structure of FGF-R1 
complexed with 3 - [4- (4-formylpiperazine-l-yl) 
benzylidenyl] -2 - mdolinone . The columns are as 
described in Table 3 . 

10 DETAILED DESCRIPTION OF TTTC INVENTION 

The present invention is directed to the design and 
identification of modulators of protein tyrosine kinase 
function that are PTK subfamily specific, non- 
hydrolyzable under acidic conditions, and highly 

15 bioavailable . The three dimensional structures of a PTK 
optionally complexed with compounds can facilitate 
design and identification % of modulators of PTK function. 

Protein tyrosine kinases (PTKs) comprise a large 
and diverse class of enzymes. Schlessinger and Ullrich, 

20 1992, Neuron 9: 383-391. The PTK family is subdivided 
into members that are receptors and those that are non- 
receptors. The PTK receptor family contains multiple 
subfamilies, one of which is the fibroblast growth 
factor receptor (FGP-R) PTK which is a molecule 

25 implicated in regulating angiogenesis a well as cellular 
proliferation and differentiation. Givol and Yayon, 
1992, FASEB J. 6 (15): 3362-3369. 

FGF-R1 can mediates cellular functions by its role 
in one or more cellular signal transduction processes. 

30 Cellular signal transduction processes comprise multiple 
steps that convert an extracellular signal into an 
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intracellular signal. 

Receptor PTK mediated signal transduce ion is 
initiated by binding a specific extracellular 1 igand , 
followed by receptor dimerization, and subsequent 

5 autophosphoryiation of the receptor PTK. The phosphate 
groups are binding sites for intracellular signal 
transduction molecules which leads to the formation of 
protein complexes at the cell membrane. These complexes 
facilitate an appropriate cellular effect (e.g., cell 
10 division, metabolic effects to the extracellular 

microenvironment ) in response to the iigand that began 
the cascade of events. 

Receptor PTKs function as binding sites for several 
intracellular proteins. Intracellular PTK binding 

15 proteins are divided into two principal groups: (1) 
those which harbor a catalytic domain; and (2) those 
which lack such a domain but serve as adapters and 
associate with catalytically active molecules. Songyang 
ec al., 1993, Cell 72:767-778. SH2 (sre homology) 

20 domains are common adaptors found in proteins which 
directly bind to the receptor PTK. SH2 domains are 
harbored by PTK binding proteins of both groups 
mentioned above. Pantl et al., 1992, Cell 6$: 413 -423; 
Songyang et al., 1994, Mol. Cell. Biol. 14:2777-2785); 

25 Songyang et al., 1993, Cell 72:767-778; and Koch et al., 
1991, Science 252:668-678. 

The specificity of the interactions between 
receptor PTKs and* the SH2 domains of their binding 
proteins is determined by the amino acid residues 

30 immediately surrounding the phosphorylated tyrosine 

residue. Differences in the binding affinities of SH2 
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domains is correlated with -he observed differences in 
substrate phosphorylation profiles of downstream 
molecules in the signal transduction process. Songyang 
sc ai., 1993, Cell 72:767-778. These observations 
suggest that the function of each receptor PTK is 
determined not only by its pattern of expression and 
ligand availability but also by the array of downstream 
signal transduction pathways that are activated by a 
particular receptor. Thus, PTKs provide a controlling 
regulatory role m signal transduction processes as a 
consequence of autophosphorylation . 

PTK-mediated signal transduction regulates cell 
proliferative, differentiation, and metabolic responses 
in cells. Therefore, inappropriate PTK activity can 
result in a wide array of disorders and diseases. These 
disorders, which are described below, may be treated by 
the modulators of PTK function designed or identified by 
the methods disclosed herein. 

The present invention also relates to crystalline 
polypeptides corresponding to the catalytic domain of 
receptor tyrosine kinases. Such tyrosine kinases 
include receptors of a class that are not covalently 
cross -linked but are understood to undergo ligand- 
induced dimerization, as well as cytoplasmic tyrosine 
kinases. Preferably, the crystalline catalytic domains 
are of sufficient quality to allow for the determination 
of a three-dimensional X-ray diffraction structure to a 
resolution of about 1.5 A to about 2.5 A. The invention 
also relates to methods for preparing and crystallizing 
the polypeptides. The polypeptides themselves, as well 
as information derived from their crystal structures can 
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be used :o analyze and modify tyrosine kinase activity 
as well as to identify compounds that interact with the 
catalytic domain. 

The polypeptides of the invention are designed on 
5 the basis of the structure of a region in the 

cytoplasmic domain of the receptor tyrosine kinase that 
contains the catalytic domain. By way of illustration, 
FIG. 6A shows the amino acid sequence alignment of the 
catalytic domains of eighteen human receptor tyrosine 

10 kinases; one representative member from each of the 
eighteen subfamilies is shown. FIG. 6B shows the 
alignment for cytoplasmic kinases. The applicants have 
discovered and determined the boundari.es of the domain 
required for crystallization of the resulting 

15 polypeptide. Surprisingly, these boundaries differ from 
that required for catalytic activity. For example, 
referring to FIG. 6A, the domain required for catalytic 
activity is generally believed to span about 7 amino 
acid residues upstream of the first glycine (FIG. 6A 

20 residue number 485) of the N -terminal glycine-rich 

region through about 10 residues beyond the C-terminal 
conserved arginine (FIG. 6A, residue number 744) . 
However, the additional sequence upstream of the N- 
terminal glycine-rich region and downstream of the C- 

25 terminal conserved arginine can be required for 

crystallization. In particular, at least about 20 amino 
acid residues (+/- 5 amino acid residues) upstream of 
the first glycine ( i.e. . FIG. 6A, residue number 485) in 
the conserved glycine-rich region of the catalytic 

30 domain, and at least about 17 amino acid residues (+/- 5 
amino acid residues) downstream of the conserved 
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arginine ( i.e. , FIG. 6A, residue number 744) iocaced at 
the C- terminal boundary of the catalytic domain can be 
required to engineer a polypeptide suitable for 
crystallization . 
5 In those situations where the resulting polypeptide 

contains cysteine residues that interfere with 
crystallization ( e.g. , cysteine residue numbers 438 and 
534 in the FGF-R1 sequence shown in FIG. 6A) , such 
cysteine residues can be substituted with an appropriate 

10 amino acid that does not readily form covalent bonds 
with other amino acid residues under crystallization 
conditions; e.g. . by substituting the cysteine with Ala, 
Ser or Gly. Any cysteine located in a non-helical or 
non- 3 -stranded segment, based on secondary structure 

15 assignments, are good candidates for replacement. For 
example, cysteines located in regions corresponding to 
the glycine-rich-loop, the kinase insert, the 
juxtamembrane region or the activation loop are prime 
candidates for replacement. However, substitutions of 

20 cysteine residues that are conserved among the kinases 
(e.g. . FIG. 6A at positions 725 and 736) are preferably 
avoided. 

I. PTX Associated Diseases 

25 Blood vessel proliferative disorders refer to 

angiogenic and vasculogenic disorders generally 
resulting in abnormal proliferation of blood vessels. 
The formation and spreading of blood vessels play 
important roles in a variety of physiological processes 

3 0 such as embryonic development, corpus luteum formation, 
wound healing and organ regeneration. They also play a 
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pivotal role ir. cancer development. Other examples of 
blood vessel proliferation disorders include arthritis, 
where new capillary blood vessels invade the joint and 
destroy cartilage, and ocular diseases, like diabetic 
5 retinopathy, where new capillaries in the retina invade 
the vitreous, bleed and cause blindness. Conversely, 
disorders related to the shrinkage, contraction or 
closing of blood vessels are implicated in such diseases 
as restenosis. 

10 Fibrotic disorders refer to the abnormal formation 

of extracellular matrix. Examples of fibrotic disorders 
include hepatic cirrhosis and mesangial cell 
proliferative disorders. Hepatic cirrhosis is 
characterized by the increase in extracellular matrix 

15 . constituents resulting in the formation of a hepatic 
• scar. Hepatic cirrhosis can cause diseases such as 
cirrhosis of the liver. An increased extracellular 
matrix resulting in a hepatic scar can also be caused by 
viral infection such as hepatitis. 

20 Mesangial cell proliferative disorders refer to 

disorders brought about by abnormal proliferation of 
mesangial cells. Mesangial proliferative disorders 
include various human renal diseases, such as 
glomerulonephritis, diabetic nephropathy, malignant 

25 nephrosclerosis, thrombotic microangiopathy syndromes, 
transplant rejection, and glomerulopathies. The PDGF-R 
has been implicated in the maintenance of mesangial cell 
proliferation. Floege et al . , 1993, Kidney 
International £1:47S-54S. 

30 PTKs are directly associated with the cell 

proliferative disorders described above. For example, 
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some members of "he receptor ?TK family have been 
associated with the development of cancer. Some of 
these receptors, like EGFR (Tuzi ec al . , 1991, Br. J. 
Cancer 63:221 -233 ; Torp ec al . , 1992, APMIS 100:112- 
5 719) HER2 /neu (Slamon et al . , 1939, Science 244:707-712) 
and PDGF-R (Kumabe et al . , 1992, Oncogene 7:627-633) are 
over-expressed in many tumors and/or persistently 
activated by autocrine loops. In fact, PTK over- 
expression (Akbasak and Suner-Akbasak et al . , 1992, J. 
10 Neurol. Sci . 111:119-133; Dickson et al . , 1992, Cancer 
Treatment Res. £1:249-273; Korc et al . , 1992, J. Clin. 
Invest. 90:1352-1360) and autocrine loop stimulation 
(Lee and Donoghue, 1992, J. Cell. Biol. 118:1057-1070; 
Korc et al., supra; Akbasak and Suner-Akbasak et al . , 
15 supra) account for the most common and severe cancers. 
For example, EGFR is associated with squamous cell 
carcinoma, astrocytoma, glioblastoma, head and neck 
cancer, lung cancer and bladder cancer. HER2 is 
associated with breast, ovarian, gastric, lung, pancreas 
20 and bladder cancer. PDGF-R is associated with 

glioblastoma, lung, ovarian, melanoma and prostate 
cancer. The receptor PTK c-met is generally associated 
with hepatocarcinogenesis and thus hepatocellular 
carcinoma. Additionally, c-met is linked to malignant 
25 tumor formation. More specifically, c-met has been 
associated with, among other cancers, colorectal, 
thyroid, pancreatic and gastric carcinoma, leukemia and 
lymphoma. Additionally, over-expression of the c-met 
gene has been detected in patients with Hodgkins 
30 disease, Burkitts disease, and the lymphoma cell line. 
The IGF- I receptor PTK, in addition to being 
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impllCaCSd in ^^onal support and - .ype-n 
aiabsces. is also associated with several types o<= 
cancers. For example, IGF- I has bee, implicated as an 
autocrine growth stimulator for several CU3lor CVDes , 
e.j. human breast cancer carcinoma cells (Arteaga et 
al-. 1989. J. ciin. Invest. S4 : 1418 -1423) and small lure 
tumor cells (Macauiey eC al . , i990 , c^cer SeS . 50.-2511- 
2517). m addition, IGF-I, integrally involved in the 
normal growth and differentiation of the nervous system 
appears to be an autocrine stimulator of human gliomas. 
Sandberg-Nordqvist et al . , 1993, Cancer Res . 53:2475- 
2478. The importance of the IGF-IR and its modulators 
in cell proliferation is further supported by the fact " 
that many cell types in culture (fibroblasts, epithelial 
cells, smooth muscle cells, T- lymphocytes, myeloid 
cells, chondrocytes, osteoblasts, the stem cells of the 
bone marrow) are stimulated to grow by IGF-I. Goldring 
and Goldring, i 991 , EuJcaryotic Gene Expression 1:301- 
326. m a series of recent publications suggest that 
IGF-IR plays a central role in the mechanisms of 
transformation and, as such, could be a preferred target 
for therapeutic interventions for a broad spectrum of 
human malignancies. Baserga, 1 995 , Cancer Res. 55:249- 
252; Baserga, 1994, Cell 79:927-930; Coppola et al . , 
1994, Mol. Cell. Biol. 24:4588-4595. 

The association between abnormalities- in receptor 
PTKs and disease are not restricted to cancer, however. 
For example, receptor PTKs are associated with metabolic 
diseases like psoriasis, diabetes mellitrus, wound 
healing, inflammation, and neurodegenerative diseases. 
EGF-R is indicated in corneal and dermal wound healing. 



WO 98/07S35 PCI7US97/ 14885 



Defects in Insulin-?, and IGF - IR are indicated in type- II 
diabetes mellitus. A more complete correlation between 
specific receptor PTKs and their therapeutic indications 
is set forth m Plowman et al . , 1994, DN&P 7:334-339. 
5 Non-receptor PTKs , including src, abl, fps, yes, 

£yn, lyn, lck, blk, hck, fgr, yrk (reviewed by Bolen et 
al . , 1992, FASEB J. 6 : 3403 - 3409 ) , are involved in the 
proliferative and metabolic signal transduction pathways 
also associated with receptor PTKs. Therefore, the 

10 present invention is also directed towards designing 
modulators against this class of PTKs. For example, 
mutated src (v-src) is an oncoprotein (pp60 v " src ) in 
chicken. Moreover, its cellular homolog, the proto- 
oncogene pp€0 c " arc transmits oncogenic signals of many 

15 receptors. For example, over-expression of EGF-R or 

HER2/neu in tumors leads to the constitutive activation 
of pp60 c ~ src , which is characteristic of the malignant 
cell but absent in the normal cell. On the other hand, 
mice deficient for the expression of c-src exhibit an 

20 osteopetrotic phenotype, indicating a key participation 
of c-src in osteoclast function and a possible 
involvement in related disorders. Similarly, Zap 70 is 
implicated in T-cell signaling. Both receptor PTKs and 
non- receptor PTKs are connected to hyperimmune 

25 disorders. 

The instant invention is directed in part towards 
designing modulators of PTK function that, could 
indirectly kill tumors by cutting off their source of 
sustenance. Normal vasculogenesis and angiogenesis play 

3 0 important roles in a variety of physiological processes 
such as embryonic development, wound healing, organ 
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regeneration and female reproductive processes such as 
follicle development in the corpus iuteum during 
ovulation and placental growth after pregnancy. Foixman 
and Shmg, 1992, J. 3iological Chem. 267 ■. 10331 -34 . 
However, -any diseases are driven by persistent 
unregulated or inappropriate angiogenesis . For example, 
in arthritis, new capillary blood vessels invade the 
joint and destroy the cartilage. In diabetes, new 
capillaries in the retina invade the vitreous, bleed and 
cause blindness. Folkman, 1987, in: Congress of 
Thrombosis and Haemostasis (Verstraete, ec. a 1 , eds . ) , 
Leuven University Press, Leuven, pp. 583 -596. Ocular 
neovascularization is the most common cause of blindness 
and dominates approximately twenty (20) eye diseases. 

Moreover, vasculogenesis and/or angiogenesis can be 
associated with the growth of malignant solid tumors and 
metastasis. A tumor must continuously stimulate the 
growth of new capillary blood vessels for the tumor 
itself to grow. Furthermore, the new blood vessels 
embedded in a tumor provide a gateway for tumor cells to 
enter the circulation and to metastasize to distant 
sites in the body. Folkman, 1990, J. Natl. Cancer Inst. 
82:4-6; Klagsbrunn and Soker, 1993, Current Biology 
3:699-702; Folkman, 1991, J. Natl., Cancer Inst. 52:4-6; 
Weidner et al., 1991, New Engl. J. Med. 324:1-5. 

Several polypeptides with in vitro endothelial cell 
growth promoting activity have been identified. 
Examples include acidic and basic fibroblastic growth 
factor (aFGF, pFGF) , vascular endothelial growth factor 
(VEGF) and placental growth factor. Unlike aFGF and 
(JFGF, VEGF has recently been reported to be an 
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endothelial ceil specific mitogen. Ferrara and Henzel, 
1939, Biochem. Biophys. Res, Comm. 161 : 351 - 353 ; Vaisman 
ec al . , 1990, J. Biol. Chem. 265:19461-19566. 

Thus, identifying the specific receptors that bind 
FG? or VEGF is important for understanding endothelial 
ceil proliferation regulation. Two structurally related 
receptor PTKs that bind VEGF with high affinity are 
identified: the flt-1 receptor (Shibuya et al . , 1990, 
Oncogene 5:519-524; De Vries et al . , 1992, Science 
255:939-991) and the KDR/FLK-1 "receptor, discussed in 
the U.S. Patent Application No. 08/193,829. In 
addition, a receptor that binds aFGF and (SFGF is 
identified. Jaye et al . , 1992, Biochem. Biophys . Acta 
1135:185-199). Consequently, these receptor PTKs most 
likely regulate endothelial cell proliferation. 

FGFRs play important roles in angiogenesis , wound 
healing, embryonic development, and malignant 
transformation. Basilico and Moscatelli, 1992, Adv. 
Cancer Res. 59:115-165. Four mammalian FGFR (FGFR1-4) 
have been described and additional diversity Is 
generated by alternative RNA splicing within the 
extracellular domains. Jaye et aJ . , 1992, Biochem. 
Biophys. Acta 1135:185-199. Like other receptor PTKs, 
dimerization of FGF receptors is essential for their 
activation. Soluble or cell surface-bound heparin 
sulfate proteoglycans act in concert with FGF to induce 
dimerization (Schlessinger et ai., 1995, Cell 53:357- 
360) , which leads to autophosphorylation of specific 
tyrosine residues in the cytoplasmic domain. Mohammadi 
et al., 1996, Mol. Cell Biol. 1^:977-989. 

Mutations in three human FGF receptor genes, FGFR1 
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FG7R2, and FG7R3 , have be<=>- ; -n];-*~^ • 

--P^-av-ed in a variety of 

. »Kei_ca. aisoraers. Mucacions in ?r. ?R : 

and FGFR2 result in ,;, 3 premature fusion of the fla -_" 
bcr.es of che skull and cause che crmiosyno.cosis 
syndromes, such as Apert ( FGFR2 ) (wUkie et ai . , 1994. 
Nac. Genec . S:269 . 274)/ Pfsiffer (?GFR1 ?g?r2) 

(Muenke et a i . , 1994, Nat . Genet. 8:269-274), 
Jackson-Weiss (FGFR2) ( Jabs et al . , 1994, Nat. Genet. 
5:275-279) and Crouzon (FGFR2) (Jabs et al . , 1994, Nac 
Genet. 8:275-279) syndromes. .In contrast mutations in 
FGFR3 are implicated in long bone disorders and cause 
several clinically related forms of dwarfism including 
achondroplasia (Shiang et al . , 1994, Cell 78 : 335- 342 ) , 
hypochondroplasia (Bellus et al . , 1995, Nat. Genet. 
10:357-359) and the neonatal lethal thanatophoric 
dysplasia (Tavormina et al . , 1995, Nat. Genet. 5:321- 
328). It has been shown that these mutations lead to 
constitutive activation of the tyrosine kinase activity 
of FGFR3 (Webster et al . , 1996, EMBO J. 25:520-527). 
Furthermore gene -targeting experiments in mice have 
revealed an essential role for FGFR3 in developmental 
bone formation (Deng et al., 199s, cell 54:911-921). 

Another major role proposed for FGFs in vivo is the 
induction of angiogenesis (Folkman and Klagsbrun, 1987, 
Science 2J*:442) . . Therefore, inappropriate expression 
of FGFs or of their receptors or aberrant function of 
the tyrosine kinase activity could contribute to several 
human angiogenic pathologies such as diabetic 
retinopathy, rheumatoid arthritis, atherosclerosis and 
tumor neovascularization (Klagsbrun and Edelman, 1989, 
Arteriosclerosis 9:269). Moreover, FGFs are thought to 
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be involved in malignant z ransf ornacion . Indeed, the 
genes coding for the three FGF homologues int-2, FGF- 5 
and hst-l/K-fgf were originally isolated as oncogenes. 
Furthermore, the cDNA encoding FGFR1 and FGFR2 are 
amplified in a population of breast cancers (Adnane et 
al., 1991, Oncogene 6:G59-6G2) . Over-expression of FGF 
receptors has been also detected in human pancreatic 
cancers, astrocytomas, salivary gland adenosarcomas, 
Kaposi sarcomas, ovarian cancers and prostate cancers. 

Evidence, such as the disclosure set forth in 
copending U.S. Application Serial No. 08/153,829, 
strongly suggests that VEGF is not only responsible for 
endothelial cell proliferation, but also is a prime 
regulator of normal and pathological angiogenesis . See 
generally, Klagsburn and Soker, 1993, Current Biology 
3:699-702; Houck et al . , 1992, J. Biol. Chem. 
257:26031-26037. Moreover, it has been shown that 
KDR/FLK- 1 and flt-1 are abundantly expressed in the 
proliferating endothelial cells of a growing tumor, but 
not in the surrounding quiescent endothelial cells. 
Plate et al . , 1992, Nature 359:845-848; Shweiki et al . , 
1992, Nature 359:843-845. 

The invention is directed to designing and 
identifying modulators of receptor and non-receptor PTK 
functions that could modify the inappropriate activity 
of a PTK involved with a clinical disorder. The 
rational design and identification of modulators of PTK 
functions can be accomplished by utilizing the 
structural coordinates that define a PTK three 
dimensional structure. 
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I! . ModuUt^rs of PTX functions as Therapeutics fnr 
Disease 

As a consequence of the disorders discussed above, 
scientists in the biomedical community are searching for 
modulators of PTK functions that down- regulate signal 
transduction pathways associated with inappropriate ?TK 
activity . 

In particular, small molecule modulators of PTK 
functions are sought as some can traverse the cell 
membrane and do not hydrolyze in acidic environments. 
Some compounds have already been discovered. For 
example, bis monocyclic, bicyclic or heterocyclic aryl 
compounds (PCT WO 92/20642) , vinylene-azaindole 
derivatives (PCT WO 94/14 808) l-cyclopropyl-4-pyridyl- 
quinolones (U.S. Patent No. 5,330,992), styryl compounds 
(U.S. Patent No. 5,217,999), styryl-substituted pyridyl 
compounds (U.S. Patent No. 5,302,606), certain 
quinazoline derivatives (EP Application No. 0 566 266 
Al), seleoindoles and selenides (PCT WO 94/03427), 
tricyclic polyhydroxylic compounds (PCT WO 92/21660), 
and benzylphosphonic acid compounds (PCT WO 91/15495) 
are described as PTK inhibitors. 

Although some modulators of PTK function are known, 
many of these are not specific for PTK subfamilies and 
will therefore cause multiple side-effects as 
therapeutics. Compounds of the oxindolinone/ 
thiolindolinone family, however, are specific for the 
FGF receptor subfamily (U.S. Patent Application Serial 
No. 08/702,232, filed August 23, 1996, invented by Tang 
et al., entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of 
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Disease," Attorney Docket Mo. 221/137). In addition, 
compounds of the oxindolinone/thiolindolinone family are 
non-hydroiyzable in acidic conditions and can be highly 
bioavailabie . 

The invention provides information regarding the 
specific interactions between a PTK and compounds of the 
oxindolinone/thiolindolinone family. Although the use 
of X-ray crystallography has provided three dimensional 
structures of other PTKs , the PTKs in these structures 
are not compiexed with PTK subfamily specific, 
hydrolysis resistant, highly bioavailabie small 
molecules. The X-ray crystallography techniques used in 
the current invention resolve interactions between a PTK 
and compounds in complex with it at the atomic level, 
which provides detailed information regarding the 
orientation of chemical groups defining an effective 
modulator of PTK function. 

Ill , rryatalline Tyrosine Kinases 

Crystalline PTKs of the invention include native 
crystals, derivative crystals and co-crystals. The 
native crystals of the invention generally comprise 
substantially pure polypeptides corresponding to the 
tyrosine kinase domain in crystalline form. 

It is to be understood that the crystalline 
tyrosine kinase domains of the invention are not limited 
to naturally occurring or native tyrosine kinase 
domains. Indeed, the crystals of the invention include 
mutants of native tyrosine kinase domains. Mutants of 
native tyrosine kinase domains are obtained by replacing 
at least one amino acid residue in a native tyrosine 
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kinase domain with a different amino acid residue, or by 
adding or deleting amino acid residues within the native 
polypeptide or at the N- or C- terminus of the native 
polypeptide, and have substantially the same three - 
5 dimensional structure as the native tyrosine kinase 
domain from which the mutant is derived. 

By having substantially the same three-dimensional 
structure is meant having a sec of acomic scructure 
coordinates that have a root -mean-square deviation of 

10 less than or equal to about 2A when superimposed with 

the atomic structure coordinates of Che native tyrosine 
kinase domain from which the mutant is derived when at 
least about 50% to 100% of the Cot atoms of the native 
tyrosine kinase domain are included in the 

15 superposition. 

Amino acid substitutions, deletions and additions 
which do not significantly interfere with the three- 
dimensional structure of the tyrosine kinase domain will 
depend, in part, on the region of the tyrosine kinase 

20 domain where the substitution, addition or deletion 
occurs. In highly variable regions of the molecule, 
such as those shown in FIG. 6, non- conservative 
substitutions as well as conservative substitutions may 
be tolerated without significantly disrupting the three- 

25 dimensional structure of the molecule. In highly 

conserved regions, or regions containing significant 
secondary structure, such as those regions shown in FIG. 
6, conservative amino acid substitutions are preferred. 
Conservative amino acid substitutions are well- 

30 known in the art, and include substi tut ions 'made on the 
basis of similarity in polarity, charge, solubility, 
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hydrcphobicity , hydrcphilicicy and/or che amphipathic 
na:ure of che amino acid residues involved. For 
example , negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids 
include lysine and arginine; amino acids with uncharged 
polar head groups having similar hydrophiiicity values 
include the following: leucine, isoieucine, valine; 
glycine, alanine; asparagine, glutamine; serine, 
threonine; phenylalanine, tyrosine. Other conservative 
amino acid substitutions are well known in the art. 

For tyrosine kinase domains obtained in whole or in 
part by chemical synthesis, the selection of amino acids 
available for substitution or addition is not limited co 
the genetically encoded amino acids. Indeed, the 
mutants described herein may contain non-genetically 
encoded amino acids. Conservative amino acid 
substitutions for many of the commonly known non- 
genetically encoded amino acids are well known in the 
art. Conservative substi tut ions for other amino acids 
can be determined based on their physical properties as 
compared to the properties of the genetically encoded 
amino acids . 

In some instances, it may be particularly 
advantageous or convenient to substitute, delete and/or 
add amino acid residues to a native tyrosine kinase 
domain in order to provide convenient cloning sites in 
cDNA encoding the polypeptide, to aid in purification of 
the polypeptide, and for crystallization of the 
polypeptide. Such substitutions, deletions and/or 
additions which do not substantially alter the three 
dimensional structure of the native tyrosine kinase 
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domain will be apparent tc -.hose of ordinary skill m 
~hs art . 

It should be noted that the mutants contemplated 
herein need not exhibit ?TK activity. indeed, amino 
acid substitutions, additions or deletions that 
interfere with the kinase activity of the tyrosine 
kinase domain but which do not significantly alter the 
three-dimensional structure of the domain are 
specifically contemplated by the invention. Such 
crystalline polypeptides, or the atomic structure 
coordinates obtained therefrom, can be used to identify 
compounds that bind to the native domain. These 
compounds may affect the activity or the native domain. 

The derivative crystals of the invention generally 
comprise a crystalline tyrosine kinase domain 
polypeptide in covalent association with one or more 
heavy metal atoms. The polypeptide may correspond to a 
native or a mutated tyrosine kinase domain. Heavy metal 
atoms useful for providing derivative crystals include, 
by way of example and not limitation, gold, mercury, 
etc. 

The co-crystals of the invention generally comprise 
a crystalline tyrosine kinase domain polypeptide in 
association with one or more compounds. The association 
may be covalent or non- covalent . Such compounds 
include, but are not limited to, cof actors, substrates, 
substrate analogues, inhibitors, allosteric effectors, 
etc. 
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IV • Three Dimensional. Structure De^g^ - p.acion 'Jsir.a X- 

ray Crystallography 

X-ray crystallography is a method of solving the 
three dimensional structures of molecules. The 
5 structure of a molecule is calculated from X-ray 

diffraction patterns using a crystal as a diffraction 
grating. Three dimensional structures of protein 
molecules arise from crystals grown from a concentrated 
aqueous solution of that protein. The process of X-ray 
10 crystallography can include the following steps: 

(a) synthesizing and isolating a polypeptide; 

(b) growing a crystal from an aqueous solution 
comprising the polypeptide with or without a 
modulator; and 

(c) collecting X-ray diffraction patterns from the 
crystals, determining unit cell dimensions and 
symmetry, determining electron density, 
fitting the amino acid sequence of the 
polypeptide to the electron density, and 
refining the structure. 

Production of Polypeptides 

The native and mutated tyrosine kinase domain 

25 polypeptides described herein may be chemically 

synthesized in whole or part using techniques that are 
well-known in the art ( see . e. g .. Creighton, 1983). 
Alternatively, methods which are well known to those 
skilled in the art can be used to construct expression 

3 0 vectors containing the native or mutated tyrosine kinase 
domain polypeptide coding sequence and appropriate 
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20 
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transcripcional/transiarionai control signals. These 
methods include in vitro recombinant DNA techniques, 
synthetic techniques and in vivo recombinat ion/aenet ic 
recombination. See, for example, the techniques 
described in Maniatis et al . , 1989 and Ausubel et al . , 
1939. 

A variety of host -expression vector systems may be 
utilized to express the tyrosine kinase domain coding 
sequence. These include but are not limited to 
microorganisms such as bacteria transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing the tyrosine kinase domain 
coding sequence; yeast transformed with recombinant 
yeast expression vectors containing the tyrosine kinase 
domain coding sequence; insect cell systems infected 
with recombinant virus expression vectors ( e.g. . 
baculovirus) containing the tyrosine kinase domain 
coding sequence; plant cell systems infected with 
recombinant virus expression vectors (e.g. . cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or 
transformed with recombinant plasmid expression vectors 
(e.g. » Ti plasmid) containing the tyrosine kinase domain 
coding sequence; or animal cell systems. The expression 
elements of these systems vary in their strength and 
specificities. 

Depending on the host/vector system utilized, any 
of a number of suitable transcription and translation 
elements, including constitutive and inducible 
promoters, may be used in the expression vector. For 
example, when cloning in bacterial systems, inducible 
promoters such as pL of bacteriophage X, plac, ptrp, 
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c:ac ! ocrc - iac nyc rid promoter) and the like may be 
used; when cloning in insect cell systems, promoters 
such as the caculovirus polyhedrm promoter may be used; 
when cloning m plane cell systems, promoters derived 
5 from the genome of plant cells (e.g. . heat shock 
promoters; the promoter for the small subunit of 
RUB1SCO; the promoter for the chlorophyll a/b binding 
protein) or from plant viruses (e.g. . the 35S RNA 
cromoter of CaMV; the coat protein promoter of TMV) may 

10 be used; when cloning in mammalian cell systems, 

promoters derived from the genome of mammalian cells 
( e.g. , metallothionein promoter) or from mammalian 
viruses ( e.g. , the adenovirus late promoter; the 
vaccinia virus 7 . 5K promoter) may be used; when 

15 generating cell lines that contain multiple copies of 
the tyrosine kinase domain DNA, SV40-, BPV- and EBV- 
based vectors may be used with an appropriate selectable 
marker . 

Methods describing methods of DNA manipulation, 
20 vectors, various types of cells used, methods of 

incorporating the vectors into the cells, expression 
techniques, protein purification and isolation methods, 
and protein concentration methods are disclosed in 
detail with respect to the protein PYK-2 in PCT 
25 publication WO 96/18738. This publication is 

incorporated herein by reference in its entirety, 
including any drawings. Those skilled in. the art will 
appreciate that such descriptions are applicable to the 
present invention and can be easily adapted to it. 
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Crystal Growth 

Crystals are grown from an aqueous solution 
containing the purified ar.d concentrated polypeptide by 
a variety of techniques. These techniques include 
5 batch, liquid, bridge, dialysis, vapor diffusion, and 

hanging drop methods. Mcpherson, 1982, John Wiley, New 
York; McPherson, 1990, Eur. J. Biochem. 189:1-23; 
Webber, 1991, Adv. Protein Chem. 41:1-36, incorporated 
^y reference herein in its entirety, including all 

10 . figures, tables, and drawings. 

Generally, the native crystals of the invention are 
grcwn by adding precipitants to the concentrated 
sol" :t ion of the polypeptide corresponding to the PTK 
catalytic domain. The precipitants are added at a 

15 concentration just below that necessary to precipitate 

the protein. Water is removed by controlled evaporation 
to proiuce precipitating conditions, which are 
maintained until crystal growth ceases. 

For crystals of the invention, it has been found 

20 that hanging drops containing about 2.0 txL of tyrosine 

kinase domain polypeptide (10 mg/mL in lOmM Tris-HCl, pH 
8.0 # 10 mM NaCl and 2 mM dithiothreitol) and 2.0 
reservoir solution (16% w/v polyethylene glycol MW 
10000, 0.3 M (NH 4 ) 2 SO«, 5% v/v ethylene glycol or 

25 glycerol and 100 mM bis-Tris, pH 6.5) suspended over 0.5 
mL reservoir buffer for about 3-4 weeks at 4°C provide 
crystals suitable for high resolution X-ray structure 
determination. 

Those of ordinary skill in the art will recognize 

30 that the above -described crystallization conditions can 
be varied. Such variations may be used alone or in 
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combinat ion , and include polypeptide solutions 
containing polypeptide cor.cen:ra:ior.5 between about 1 
mg/mL and about 60 rng/ml, Tns-HCl concentrations 
between about 10 mM and aoout 200 mM, dit hiot hrei tol 
5 concentrations between about 0 mM and about 20 mM, pH 
ranges between about 5.5 and about 7.5; and reservoir 
solutions containing polyethylene glycol concentrations 
between about 10% and about 30% (w/v) , polyethylene 
glycol molecular weights between about 1000 and about 

10 20,000, (NK<) 2 S0 4 concentrations between about 0.1 M a*.d 
about 0.5 M, ethylene glycol or glycerol concentrator ns 
between about 0% and about 20% (v/v) , bis-Tris 
concentrations between about 10 mM and about 200 mM pH 
ranges between about 5.5 and about 7.5 and tempera- ure 

15 ranges between about 0° C and about 25°C. Other fc .ffer 
solutions may be used such as HEPES buffer, so ic ig as 
the desired pH range is maintained. 

Derivative crystals of the invention can bf 
obtained by soaking native crystals in mother l.quor 

20 containing salts of heavy metal atoms. It has been 
found that soaking a native crystal in a solution 
containing about 0 . 1 mM to about 5 mM thimercsal, 4- 
chloromeruribenzoic acid or KAu(CN) 2 for abo; t 2 hr to 
about 72 hr provides derivative crystals su table for 

25 use as isomorphous replacements in determinng the X-ray 
crystal structure of the tyrosine kinase domain 
polypeptide. 

Co-crystals of the invention can be obtained by 
soaking a native crystal in mother liqucr containing 
3 0 compound that bind the kinase domain, or described 

above, or can be obtained by co-crysta..lizing the kinase 
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domain pclypeocide in -ho 0 v. a _ 0 _ 0 ^ 

' P*-sence or one or -ore 

binding compounds. 

For co-crystals of tyrosine kinase domain 
polypeptide in co-co*plex with AMP-pc?, it has been 
found that co-crystaiiizing the kinase domain 
polypeptide in the presence of amp-pcp using the above . 
described crystallization conditions for obtaining 
native crystals with a polypeptide solution additionally 
containing 10 mM AMP-PCP and 20 mM MgCl 2 yields co- 
crystals suitable for the high resolution structure 
determination by X-ray crystallography. Of course 
chose having skill in the art will recognize that the 
concentrations of AMP-PCP and MgCl 2 in the polypeptide 
solution can be varied, alone or in combination with the 
variations described above for native crystals. Such 
variations include polypeptide solutions containing AMP- 
PCP concentrations between p.l mM and 50 mM and MgCl 2 
concentrations between 0 mM and 50 mM. 

Crystals comprising a polypeptide corresponding to 
a PTK catalytic domain complexed with a compound can be " 
grown by one of two methods, m the first method, the 
modulator is added to the aqueous solution containing 
the polypeptide corresponding to the PTK catalytic 
domain before the crystal is grown, m the second 
method, the modulator is soaked into an already existing 
crystal of a polypeptide corresponding to a PTK 
catalytic domain. 
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Crystalline -r,FR 

In one illustrative embodiment, the invention 
provides crystals of FGFRI . The crystals were obtained 
by the methods provided in the Examples. The FGFR1 
5 crystals, which may be native crystals, derivative 
crystals or co-crystals, have monoclinic unit cells 
( i.e. . unit cells wherein a^b^c; a=y=90°; and 3>90°) and 
space group symmetry C2 . There are two FGFR1 molecules 
m the asymmetric unit, related by an approximate two- 
10 fold axis. 

Two forms of crystalline FGFR1 were obtained. In 
one form (designated "C2-A form"), the unit cell has 
dimensions of a=208.3 A, b=57.2 A, c=65.5 A and 
3 = 107.2°. In another form (designated "C2-B form' 1 ), the 
15 unit cell has dimensions of a=211.6 A, b=51.3 A, c=66.I 
A and 3=107.7°. 

Three distinct two- fold related FGFR1 dimers are 
observed in both the C2-A and C2-B forms of the FGFR1 
crystal; one non-crystallographically related dimer and 
20 two crystallographically related dimers. The non- 
crystallographically related dimer comprises the two 
molecules; in the asymmetric unit. The residues making 
up the dimer interface are located in C-terminal lobe. 
In this dimer, the C-terminal lobes abut with the N- 

2 5 terminal lobes distal to one another. The total amount 

of surface area buried in the surface is about 950 A 2 . 
Very few of the interactions in the interface are of a 
specific nature, e.g.. hydrogen -bonding or close packing 
of hydrophobic- residues . 

3 0 There are two crystallographically-related dimers 

in the C2 lattice. In the first dimer, the residues 
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chat constitute -he dime- ; -r-*-f a ~ , . 

" iac - are limited co chose 

in che 3-sheet of che N-cermiaal lobe (amino acid 
residues 477, 4 7 5 , 4 98, 506, 503 and 496). The total 
surface area buried ir. this interface i, about 670 
The interactions are rather specific. Three hydrophobic 
residues which are partially solvent -exposed in che 
monomer, Vai-479, ii e - 4 93 and Val-508, come together 
wich their two-fold-related residues to form a compact 
hydrophobic plug. This plug is capped on either side by 
a salt bridge between Arg-477 and Glu-496. m addition, 
two main-chain hydrogen- bonds connect the (J-sheets of 
the two monomers at the start of 33 (amino acid residues 
506 and 508) . The residues in this dimer interface, or 
their residue character, are generally conserved in the 
mammalian FGF receptors, but not in the invertebrate 
homologues . 

The other crystallographically- related dimer buries 
about 1650 A 2 in its interface. In this dimer, the aC 
helices of the two monomers are nearly parallel and 
contact each other at their C- terminal ends. Met-534 
and Met-537 are in van der Waals contact with their two- 
fold-related residues, other hydrophobic contacts 
involve Pro-466 with Ue-648 and Pro-469 with Ile-676 
and Thr-678. m addition, hydrogen bonds (side-chain to 
main-chain) are made between Arg-470 and Lys-618 and 
between His-649 and Glu-464, and there. are several water 
molecules that bridge the two monomers through hydrogen 
bonding , 

In the C2-B form of the crystal, the monomers of 
this second crystallographically-related dimer are 
shifted slightly with respect to one another (6- 
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rotation) , indicating that this interlace is somewhat 
fluid. 

In both of the crystallographically- related diners, 
the N-termini of the two molecules comprising the diner 
point in the same direction and are reasonably close to 
one another. 

Determining Unit Cell Dimensions and the Three 
nim^nsio n al Structure of a Polypeptide QZ Polypept i de 
Complex 

Once the crystal is grown , it can be placed in a 
glass capillary tube and mounted onto a holding device 
connected to an X-ray generator and an X-ray detection 
device. Collection of X-ray diffraction patterns are 
well documented by those in the art. Ducruix and Geige, 
1992, IRL Press, Oxford, England, and references cited 
therein. A beam of X-rays enter the crystal and then 
diffract from the crystal. An X-ray detection device 
can be utilized to record the diffraction patterns 
emanating from the crystal. Although the X-ray 
detection device on older models of these instruments is 
a piece of film, modern instruments digitally record X- 
ray diffraction scattering. 

Methods for obtaining the three dimensional 
structure of the crystalline form of a peptide molecule 
or molecule complex are well known in the art. Ducruix 
and Geige, 1992, IRL Press, Oxford, England, and 
references cited therein. The following are steps in 
the process of determining the three dimensional 
structure of a molecule or complex from X-ray 
diffraction data. 
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After the X-ray diffraction patterns are collected 
from the crystal, the unit cell dimensions ana 
orientation m the crystal can be determined. Thev can 
oe ae-errr.ir.ed from the spacing between the diffraction 
emissions as well as the patterns made from these 
emissions. The unit ceil dimensions are characterized 
in three dimensions in units of Angstroms (one A = io- lc 
meters) and by angles at each vertices. The symmetry of 
the unit cell in the crystals is also characterized at 
this stage. The symmetry of the unit cell in the 
crystal simplifies the complexity of the collected data 
by identifying repeating patterns. Application of the 
symmetry and dimensions of the unit cell is described 
below. 

Each diffraction pattern emission is characterized 
as a vector and the data collected at this stage of the 
method determines the amplitude of each vector. The 
phases of the vectors can be determined using multiple 
techniques. In one method, heavy atoms can be soaked 
into a crystal, a method called isomorphous replacement, 
and the phases of the vectors can be determined by using 
these heavy atoms as reference points in the X-ray 
analysis. Otwinowski, 1991, Daresbury, United Kingdom, 
80-86. The isomorphous replacement method usually 
requires more than one heavy atom derivative. In 
another method, the amplitudes and phases of vectors 
from a crystalline polypeptide with an already 
determined structure can be applied to the amplitudes of 
the vectors from a crystalline polypeptide of unknown 
structure and consequently determine the phases of these 
vectors. This second method is known as molecular 
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replacement and the protein structure which is used as a 
reference must have a close!'/ related structure to the 
orotem of interest. Naraza, 1994, Proteins 11:281-296. 
Thus, the vector information from a PTK of known 
structure, such as those reported herein, are useful for 
the molecular replacement analysis of another PTK with 
unknown structure . 

Once the phases of the vectors describing the unit 
ceil of a crystal are determined, the vector amplitudes 
and phases, unit cell dimensions, and unit ceil symmetry 
can be used as terms in a Fourier transform function. 
The -Fourier transform function calculates the electron 
density in the unit cell from these measurements. The 
electron density that describes one of the molecules or 
one of the molecule complexes in the unit cell can be 
referred to as an electron density map. The amino acid 
structures of the sequence or the molecular structures 
of compounds complexed with the crystalline polypeptide 
may then fit to the electron density using a variety of 
computer programs. This step of the process is 
sometimes referred to as model building and can be 
accomplished by using computer programs such as 
TOM/FRODO. Jones, 1985, Methods in Enzymology 225:157- 
171. 

A theoretical electron density map can then be 
calculated from the amino acid structures fit to the 
experimentally determined electron density. The 
theoretical and experimental electron density maps can 
be compared to one another and the agreement between 
these two maps can be described by a parameter called an 
R-f actor. A low value for an R-f actor describes a high 
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decree of overlapping electron density between a 
theoretical and experimental electron density map. 

The R- factor is then minimized by using computer 
programs that refine the theoretical electron densitv 
5 map. A computer program such as X-PLOR can be used for 
model refinement by those skilled in the art. Brunger, 
1992, Nature 355:472-475. Refinement may be achieved in 
an iterative process. A first step can entail altering 
the conformation of atoms defined in an electron density 

10 map. The conformations of the atoms can be altered by 

simulating a rise in temperature which will increase the 
vibrational frequency of the bonds and modify positions 
of atoms in the structure. At a particular point in the 
atomic perturbation process, a force field, which 

15 typically defines interactions between atoms in terms of 
allowed bond angles and bond lengths, Van der Waals 
interactions, hydrogen bonds, ionic interactions, and 
hydrophobic interactions, can be applied to the system 
of atoms. Favorable interactions may be described in 

20 terms of free energy and the 'atoms can be moved over 

many iterations until a free energy minimum is achieved. 
The refinement process can be iterated until the R- 
f actor reaches a minimum value. 

The three dimensional structure of the molecule or 

25 molecule complex is described by atoms that fit the 

theoretical electron density characterized by a minimum 
R- value. A file can then be created for the three 
dimensional structure that defines each atom by 
coordinates in three dimensions. Examples of such 

30 structural coordinate files are defined in Table 1, 
fable 2, Table 3, and Table 4. 
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7C 

5:r-r.ursa c,f FGFR1 

The present invent ion provides high- resolut ion 
three-dimensional structures and atomic structure 
coordinates of crystalline FGFR1 and crystalline 
FGFR1 : AMP-PCP co-complex as determined by X-ray 
crystallography. The specific methods used to obtain 
the structure coordinates are provided in the examples. 
The atomic structure coordinates of crystalline FGFR1 , 
obtained from the C2-A form of the crystal to 2.0 A 
resolution, are listed in Table 3; the coordinates of 
crystalline FGFR1 : AMP-PCP co-complex, obtained from the 
C2-A form of the crystal to 2 . 3 A resolution are listed 
in Table 4 . 

Those having skill in the art will recognize that 
atomic structure coordinates as determined by X-ray 
crystallography are not without error. Thus, it is to 
be understood that any set of structure coordinates 
obtained for crystals of FGFR1, whether native crystals, 
derivative crystals or co-crystals, that have a root 
mean square deviation ("r.m.s.d.") of less than or equal 
to about 1.5 A when superimposed, using backbone atoms 
(N, C a , C and 0) , on the structure coordinates listed in 
Table 3 or Table 4 are considered to be identical with 
the structure coordinates listed in the Tables when at 
least about 50% to 100% of the backbone atoms of FGFR1 
are included in the superposition. 

Referring now to FIG. 1, the overall structure of 
FGFR1 is bi-lobate. The N- terminal lobe of FGFR1 spans 
amino acid residues 456-567 (FIG. 3) and comprises a 
curled £- sheet of five anti-parallel strands ((Jl-05) and 
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one a-helix (aC) . The ^-terminal lobe spans ammo acid 
residues 563-765 (FIG. 3 and comprises :wo Q- strands 
(37, 33) and seven a -hell :es (aD, aE, aEF , atF-al) . The 
secondary structure nomenclature follows thac used for 
IRK (Hubbard et al . , 1994) which in cum is based on the 
assignments for cAPK (Knighton et al . , 1991). FIG. 2 
shows a stereo view of a C 3 "race of FGFR1 in the same 
orientation as FIG . 1. 

A structure-based sequence alignment of the 
tyrosine kinase domains of human fibroblast growth 
factor receptor 1 (human FGFRI; labelled FGFR1) , human 
fibroblast growth factor receptors 2, 3 and 4 (labelled 
FGFR2 , FGFR3 and FGFR4 , respectively), a D. melanogaster 
homologue (labelled DFDFR1) , a C elegans homologue 
(labelled EGL-15) and insulin receptor kinase (labelled 
IRK) , is shown in FIG. 3. The sequence of FGFRI, which 
is not shown in FIG. 3 is identical to the sequence of 
FGFRI except that FGFRI has the folic wing amino acid 
substitutions and additions: Cys-488 - Ala, Cys-584 - 
Ser, Leu-457 - Val and an additional five N-terminal 
amino acids (Ser-Ala-Ala-Gly-Thr) . The secondary 
structure assignments for FGFRI and IRK vere obtained 
using the Kabsch and Sander algorithm (Kabsch and 
Sander, 1983) as implemented in PROCHECK Laskowski et 
al., 1993). In the FGF receptor sequences- a period 
represents sequence identity to FGFRI. In the IRK 
sequence, residues that are identical to FGFRI are 
highlighted. A hyphen denotes an insertion. 

The numbers under the EGL-15 sequence represent the 
fractional solvent accessibility (FSA2) of the residue 
in the FGFRI structure. The FSA ratio is the ratio of 
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the solvent -accessible surface area- of a residue in a 
Gly-X-Gly cripepcide compared t 3 that in che FGFR1 
scruccure. A value of 0 repre encs an FSA between 0.00 
and 0.09; 1 represents an FSA oecween 0.10 and 0.19, 
etc. The higher che value, r .ie more solvent -exposed che 
residue. An asterisk or pou id sign in che FSA line 
indicaces that che residue ascensk) or side chain 
(pound sign) is not includ* d in che atom model due co 
disorder. The numbers bel :>w che FSA line are che FSAs 
for chose residues chat f ;rm part of a dimer interface. 

The amino acid resi ue numbers for FGFR1, and hence 
FGFR1, and IRK provided in FIG. 3 are used in the 
discussion that followr . Significant differences in the 
N- terminal lobe of FGF il as compared to IRK are found in 
the loops between 3 s -rands and in aC. Residues from the 
end of (51 through th^ beginning of (52 (amino acid 
residues 485-490) term the nucleotide-binding loop, 
named because of it 3 role in ATP coordination. This 
residue stretch cc.itains the protein kinase-conserved 
GXGXXG sequence motif, where X is any amino acid. This 
loop is poorly o: dered in one FGFR1 molecule in the 
asymmetric unit and disordered ( i.e. ■ not included in 
the atomic mode^) in the other FGFR1 molecule in the 
asymmetric uni ; . The loop between 31 and 33 is 
disordered in both FGFR1 molecules comprising the 
asymmetric ur.it. 

Referring now to FIG. 4A, which provides a ribbon 
diagram of the N- terminal lobes of FGFR1 and IRK in 
which the Z 9 atoms of the 3-sheets have been 
superimposed, it can be seen that in FGFR1 aC is longer 
by one h-lical turn than in IRK and is oriented such 
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that residues lys -514 and Glu-531, which are conserved 
in procein kinases, for^ a sail bridge (represented by a 
black line; . While not intending to be bound by theory, 
this salt bridge is believed to be important for proper 
5 positioning of the "conserved lysine side chain, which 
coordinates two phosphate oxygens of ATP. The salt 
bridge is observed in the structures of cAPK (Knighton 
et al., 1991) and mitogen-activated protein kinase 
(MAPK) (Zhang et al . , 1994). 

10 Referring now to FIG. 4B, which provides a ribbon 

diagram of the C- terminal lobes of FGFR1 and IRK in 
which the C a atoms of the a-helices have been 
superimposed, a significant difference is found in the 
C-terminal helix of FGFR1 when compared to IRK; helix a I 

15- of FGFR1 is longer by seven residues (two helical turns) 
•than its counterpart in IRK. The extended length of al 
is presumably important in the biological functioning of 
FGF receptors, since the tyrosine autophosphorylation 
site to which an SH2 domain of PLCy binds is six 

2 0 residues C-terminal to this helix. 

The structure of FGFR1 displays an open disposition 
of the N- and C-terminal lobes. Despite having 
different sets of lattice contacts, the two FGFR1 
molecules in the asymmetric unit have only a 2° 

25 difference in relative lobe orientation. It appears as 
though the stearic interaction between residues in aC 
(Glu-531 and Met-534) with Phe-642 and Gly-643 of the 
protein kinase -conserved DFG sequence at the beginning 
of the activation loop accounts for the open 

30 • conformation of FGFR1 . 

The active site of FGFR1 is characterized by at 
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lease amine acid residues spanning the catalytic loop, 
activation loop and nucleotide binding loop. Unlike the 
structure of IRK, m which Tyr- 116 2 occupies the active 
site of the rr.olecuie, the active sites of both FGFR1 
molecules in the asymmetric unit are unoccupied. 

The activation loop, which regulates 
phosphorylation, is characterized by at least resides 
640 to 663. Quite surprisingly, while the activation 
loops of FGFR1 and IRK contain the same number of ammo 
acid residues and share greater than 50% sequence 
homology, the paths of the polypeptide chains are 
strikingly dissimilar, diverging at Ala-640 (Gly-1149 in 
IRK) and reconverging at Val-664 (Val-1173 in IRK) . 
Tyr-653 and Tyr 564 are not bound, in the active site. 
Instead, these residues point away from it. Tyr-653 is 
in van der Waals contact with several hydrophobic 
residues (Val-664, Leu-672 and Phe-710) and is hydrogen- 
bonded via its hydroxyl group to a bacicbone carbonyl 
oxygen (Leu-672) . Tyr-654 is more solvent exposed than 
0 Tyr-653, and its only van der Waals contact is with Val- 
706. Temperature factor data suggest. that the 
activation loop is relatively mobile and adopts multiple 
conformations. 

The catalytic loop of protein kinases lies between 
5 secondary structure elements aE and (S7 and contains an 

invariant aspartic acid residue (Asp-623 lir FGFR1) which 
serves as the catalytic base in the phosphotransf er 
reaction, abstracting the proton from the hydroxyl group 
of the substrate tyrosine, serine or threonine. The 
0 catalytic loop sequence of FGFR1 comprises at least 
residues His-621 to Asn-628 (amino acid sequence 
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HRDLAARN) , and is identical to that for IRK and xos: 
receptor and non-receptor ?TKs . 

In addition to the two tyrosine autophosphorylat ion 
sites in the activation loop (Tyr - 6 53 and Tyr-554;. 
5 there are four other autophosphorylat ion sites present 
in the FGFR1 crystals of the invention: one in the 
juxtamembrane region (Tyr-463), two in the kinase insert 
(Tyr-583 and Tyr-585) and one in the C-terminal lobe 
(Tyr-730) (Mohammadi et al . , 1996). They exhibit 
10 varying degrees of conservation in mammalian FGF 

receptors: Tyr-463 and Tyr-585 in FGFR1 and 2; Tyr-583 
in FGFR1 , 2 and 3; and Tyr-73 0 in FGFR 1, 2, 3 and 4 
(FIG. 3) . 

•Referring now to FIG. 5, the positions of the 

15 autophosphorylat ion sites are mapped onto the FGFR1 
structure. The juxtamembrane site (Tyr-463) and the 
residues N- terminal to it are disordered in one of the 
FGFR1 molecules in the asymmetric unit . In the other 
molecule in the asymmetric unit Tyr-4 63 is involved in a 

20 lattice contact. 

The kinase insert region (the region between 
helices otD and aE) contains autophosphorylat ion sites 
Tyr-583 and Tyr-585 and is disordered in both FGFR1 
molecules in the asymmetric unit of the C2-A form of the 

25 crystal. In the C2-B form, several lattice contacts 

partially pin down this region in one of the two FGFR1 
molecules in the asymmetric unit, allowing a trace of 
the polypeptide chain to be made. There is no well- 
defined secondary structure for these residues. Tyr- 

3 0 730, situated in oH in the C- terminal lobe, is nearly 
buried and the side-chain hydroxyl group makes two 
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hydrogen - bonds . The side chains of neighboring Met -73 2 
and Met-733 are both buried. Therefore, phosphorylation 
of Tyr-730 would presumably require prior unfolding of 
aH . 

5 Aside from Tyr-730, the five other 

autophosphoryiation sites (including Tyr-653 and Tyr- 
654) are found in relatively mobile segments of the 
FGFR1 molecule. While not intending to be bound by 
theory, the spatial positions of the autophosphoryiation 
10 sites relative to the active site suggest that 

autophosphoryiation occurs by a crans mechanism between 
two kinase domains, supporting the hypothesis that 
iigand- induced receptor dimerization is critical for the 
initiation of autophosphoryiation events. 
15 The structure of crystalline FGFR1 : AMP-PCP co- 

complex is essentially similar to that observed for 
crystalline FGFR1 . There are no significant changes in 
the structure of FGFR1 induced by AMP-PCP binding. In 
particular, binding of AMP-PCP, and by extension ATP, 
20 does not by itself promote lobe closure under the 
crystallization conditions used. Furthermore, 
complexation did not result in any noticeable changes in 
the conformations of the activation and nucleotide- 
binding loops. 

25 The crystalline FGFR1 : AMP-PCP co-complex contains 

hydrogen bonds that are present between Nl of adenine 
and the amide nitrogen of Ala- 564 and between N6 of 
adenine and the carbonyl oxygen of Glu-562. The adenine 
ring is flanked on one side by Leu-484 and Val-492 (N- 

3 0 terminal lobe) and on the other side by Leu-630 

(C- terminal lobe) . The ribose hydroxy 1 groups make no 
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direct hydrogen bonds wizh protein atoms. Lys-514 is 
hydrogen-bonded to oxygens of the 3- ana v-phosphaces . 
There is no unambiguous electron density that would 
indicate the positions of Mg 2 * ions. Generally, AM? - PC? 
appears to be coordinated rather loosely to 
unphosphory laced FGFRl, being bound co the "roof" of the 
cleft rather than being tightly sandwiched between the 
two kinase lobes . 



Structural Different.; Rgtwpon p g f-R 

Several features distinguish the FGF- receptor 
structure from that of the insulin-receptor tyrosine 
kinase. These distinctions are likely to be important 
in signaling by FGF- receptors, and other monomeric 
receptors that are believed to undergo ligand- induced 
dimerization. 

The most significant difference between the 
structures of FGFRl and IRK is the conformation of the 
activation loop. In FGFRl, the activation loop is 
disposed such that the binding site for substrate 
peptides is blocked not by an activation loop tyrosine, 
as in IRK, but by Arg-661 and PTK- invariant Pro-663, 
while the ATP binding site is accessible. This 
represents another molecular mechanism by which a 
receptor PTK may be autoinhibited. The observed 
autoinhibition in FGFRl would appear to be weaker than 
that in IRK because of fewer specific interactions made 
by residues in the FGFRl activation loop (manifested in 
the relatively higher B-values) and the accessibility of 
the ATP site. One obvious distinction between the 
insulin and FGF receptor families is that in the former, 
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receptors are covaier.tiy linked heterotetramers (a 2 3 2 ) , 
whereas in the latter, receptor dimerization ■ is iigand 
dependent. Receptors whose kinase domains are always in 
close proximity nay require a stronger automhibit ion 
5 mechanism than those receptors that associate only upon 
ligand binding (Taylor et al . , 1995). Since most growth 
factor receptors undergo ligand-dependent dimenzation 
and activation, the FGF receptor autoinhibit ion 
mechanism appears to be a more general one. 

10 

VI . nags of rh* Crysta ls *nd Atomic Structure 
Coordinates 

The crystals of the invention, and particularly the 
atomic structure coordinates obtained therefrom, have a 
15 wide variety of uses. For example, the crystals 

described herein can be used as a starting material in 
any of the art -known methods of use for receptor and 
non-receptor tyrosine kinases. Such methods of use 
include, for example, identifying molecules that bind to 
20 the native or mutated catalytic domain of tyrosine 

kinases. The crystals and structure coordinates are 
particularly useful for identifying compounds that 
inhibit receptor and non- receptor tyrosine kinases as an 
approach towards developing new therapeutic agents (sfifi, 
25 e.g. . Levitzki and Gazit, 1995) . 

The structure coordinates described herein can be 
used as phasing models for determining the crystal 
structures of additional native or mutated tyrosine 
kinase domains, as well as the structures of co-crystals 
3 0 of such domains with ligands such as inhibitors, 
agonists, antagonists, and other molecules. The 
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str-ccure ::cri:na:es , as well as ~ ode Is of the three - 
dir.er.s icnai structures cbcsmed cheref rorr. . can also be 
usee :: aid the elucidation of soluc ion -based structures 
of native or rr.utated tyrosine kinase domains, sue r. as 
5 those obtained via MMR . Thus, the crystals and atonic 
structure coordinates of the invention provide a 
convenient means for elucidating the structures and 
functions of receptor and non-receptor tyrosine kinases. 
For purposes of clarity and discussion, the 

10 crystals of the invention will be described by reference 
to specific FGFR1 exemplary crystals. Those skilled in 
the art will appreciate that the principles described 
herein are generally applicable to crystals of the 
tyrosine kinase domain of any cytoplasmic tyrosine 

15 kinase that undergoes ligand- induced dimerization or 

-receptor tyrosine kinase, including but not limited to 
the tyrosine kinases of FIG . 6. 

VII. Structure Determination for PTKs with Unknown 

2 0 Structure Using Structural Coordinates . 

Structural coordinates, such as those set forth in 

Table 1, Table 2, Table 3, and Table 4, can be used to 
determine the three dimensional structures of PTKs with 
unknown structure. The methods described below can 
25 apply structural coordinates of a polypeptide with known 
structure to another data set, such as an amino acid 
sequence, X-ray crystallographic diffraction data, or 
nuclear magnetic resonance (NMR) data. Preferred 
embodiments of the invention relate to determining the 

3 0 three dimensional structures of PTKs and related 

polypeptides. These include receptor PTKs such as FGF- 
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3 0 

?., PDGF-R, ICR, CCK4 , MET , TRKA, AX1, TIE , E?H, RYK , 
DDR , RCS, RET, LTK, ROR1 , and MUSK . Non- receptor ?TKs 
such as SRC , 3RK , 3TK , CSK, ABL, ZAP70 , FES ( FAK, JAK , 
and ACK can also be used m the methods described 
herein . 

Structures Using Amino Acid Homology 

Homology modeling is a method of applying 
structural coordinates of a polypeptide of known 
structure to the amino acid sequence of a polypeptide of 
unknown structure. This method is accomplished using a 
computer representation of the three dimensional 
structure of a polypeptide or polypeptide complex, the 
computer representation of amino acid sequences of the 
polypeptides with known and unknown structures, and 
standard computer representations of the structures of 
amino acids. Homology modeling comprises the steps of 
(a) aligning the amino acid sequences of the 
polypeptides with and without known structure; (b) 
transferring the coordinates of the conserved' amino 
acids in the known structure to the corresponding ammo 
acids of the polypeptide of unknown structure; refining 
the subsequent three dimensional structure; and (d) 
constructing structures of the rest of the polypeptide. 
One skilled in the art recognizes that conserved amino 
acids between two proteins can be determined from the 
sequence alignment step in step (a) . 

The above method is well known to those skilled in 
the art. Greer, 1985, Science 228, 1055. Blundell et 
al., 1988, Eur. J. Biochem. 172, 513. A computer 
program currently utilized for homology modeling by 
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tr.ose skilled ir. ::.e ar: is the Homology module m 
Insight II modeling package distributed by Molecular 
5 imuiat ions Inc. 

Alignment of -he amino acid sequence is 
accomplished by first: placing the computer 
representation of the amino acid sequence of a 
polypeptide with known structure above the amino acid 
sequence of the polypeptide of unknown structure. Amino 
acids in the sequences are then compared and groups of 
amino acids that are homologous (e.g., amino acid side 
chains that are similar in chemical nature - aliphatic, 
aromatic, polar, or charged) are grouped together. This 
method will detect conserved regions of the polypeptides 
and account for amino acid insertions or deletions. 

Once the amino acid sequences of the polypeptides 
with known and unknown structures are aligned, the 
structures of the conserved amino acids in the computer 
representation of the polypeptide with known structure 
are transferred to the corresponding amino acids of the 
polypeptide whose structure is unknown. For example, a 
tyrosine in the amino acid sequence of known structure 
may be replaced by a phenylalanine, the corresponding 
homologous amino acid in the amino acid sequence of 
unknown structure. 

The structures of amino acids located in non- 
conserved regions are to be assigned manually by either 
using standard peptide geometries or molecular 
simulation techniques, such as molecular dynamics. The 
final step in the process is accomplished by refining 
the entire structure using molecular dynamics and/or 
energy minimization. 
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The homology modeling method is well known :o those 
killed in -he ar: and has been practiced using 
different crotein molecules. The three dimensional 
structure of the polypeptide corresponding to the 
catalytic domain of a serine/threonine protein kinase, 
myosin light chain protein kinase, was homology modeled 
from the cAMP- dependent protein kinase catalytic 
subunit. Knighton et al . , 1992, Science 258:130-135. 

.Structures U sing Molecular Replacement 

Molecular replacement is a method of applying the 
X-ray diffraction data of a polypeptide of known 
structure to the X-ray diffraction data of a polypeptide 
of unknown sequence. This method can be utilized to 
define the phases describing the X-ray diffraction data 
of a polypeptide of unknown structure when only the 
amplitudes are known. X-PLOR is a commonly utilized 
computer software package used for molecular 
replacement. Brunger, 1992, Nature 355:472-475. AMORE 
0 is another program used for molecular replacement. 
Navaza, 1994, Acta Crystallogr. A50: 157-163. 
Preferably, the resulting structure does not exhibit a 
root -mean- square deviation of more than 3 A. 

A goal of molecular replacement is to align the 
5 positions of atoms in the unit cell by matching electron 
diffraction data from two crystals. A program such as 
X-PLOR can involve four steps. A first step can be to 
determine the number of molecules in the unit cell and 
define the angles between them. A second step can 
0 involve rotating the diffraction data to define the 

orientation of the molecules in the unit cell. A third 
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step car. be "rar.siace che eiec^n .30--;- - 

dimensions to <-o*->-o/~r : •> --<-.-.__ _ „ „ 

_.„s on ._ne moiecuies in - 

ur.it cel.. once the amplitudes and phases of the X-ray 
-:::racuon data is determined, an '-factor can be 
calculated by comparing electron diffraction naps 
calculated experimentally from the reference data set 
and calculated from the new data set. An R-f actor 
between 30-50% indicates that the orientations of the 
atoms in the unit ceil are reasonably determined by this 
method. A fourth step in the process can be to decrease 
the R-factor to roughly 20% by refining the new electron 
density map using iterative refinement techniques 
described herein and known to those or ordinary skill in 
the art . 



Structures rising mmr n al -fl 

Structural coordinates of a polypeptide or 
polypeptide complex derived from X-ray crystallographic 
techniques can be applied towards the elucidation of 
three dimensional structures of polypeptides from 
nuclear magnetic resonance (NMR) data. This method is 
used by those skilled in the art. Wuthrich, 1986, John 
Wiley and Sons, New York: 176-199; pflugrath et al., 
1986, J. Molecular Biology 299:383-386; Kline ec al., 
1986, J. Molecular Biology 235:377-382. While the 
secondary structure of a polypeptide is often readily 
determined by utilizing two-dimensional NMR data, the 
spatial connections between individual pieces of 
secondary structure are not as readily determinable. 
The coordinates defining a three-dimensional structure 
of a polypeptide derived from X-ray crystallographic 
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techniques can guide the NMR spect roscopist to an 
understanding of these spatial interactions between 
secondary structural elements in a polypeptide of 
related structure. 

The knowledge of spatial interactions between 
secondary structural elements can greatly simplify 
Nuclear Overhauser Effect (NOE) data from two- 
dimensional NMR experiments. Additionally, applying the 
crystallographic coordinates after the determination of 
secondary structure by NMR techniques only simplifies 
the assignment of NOEs relating to particular amino 
acids in the polypeptide sequence and does not greatly 
bias the NMR analysis of polypeptide structure. 
Conversely, using the crystallographic coordinates to 
simplify NOE data while determining secondary structure 
of the polypeptide would bias the NMR analysis of 
protein structure. 

As the analysis of polypeptide structure by NMR 
methods is a relatively new technique, the use of 
structural coordinates defining a PTK structure will 
most likely be utilized more frequently in the near 
future. As the method progresses, the three dimensional 
structure analysis of polypeptides of the same size as a 
PTK catalytic domain will become more frequent. 

VIII. gtmcture -Baaed Design of Modulators Of PTK 

Function Utilizing St ructural Coordinates 
Structure-based, modulator design and identification 

methods are powerful techniques that can involve 

searches of computer data bases containing a wide 

variety of potential modulators and chemical functional 
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crcups . The computerized design and identification z i 
T.cdulators is useful as "he computer data oases contain 
~::e co-pounds than the chemical libraries, often bv an 
order of magnitude. For reviews of structure-based drug 
5 design and identification see Kuntz e c al . I 1994, Acc. 
Chen, Res. 27:117; Guida, 1994, Current Opinion in 
Struc. 3iol. 4: 111; Colman, 1994, Current Opinion in 
Szruc. 3iol. 4: 368. 

The three dimensional structure of a polypeptide 

10 defined by structural coordinates can be utilized by 
these design methods. The structural coordinates of 
Table 1, Table 2, Table 3, and Table 4 can be utilized 
by this method. In addition, the three dimensional 
structures of receptor and non-receptor PTKs determined 

15 by the homology, molecular replacement, and NMR 

techniques described herein can also be applied to 
modulator design and identification methods. Thus, the 
structures of receptor PTKs, FGF-R, PDGF-R, FLK, CCK4 , 
MET, TRKA, AXL, TIE, EPH, RYK, DDR, ROS, RET, LTK, ROR1,. 

20 and MUSK, can be utilized by the methods described 

herein. The structures of non-receptor PTKs, SRC, BRK, 
BTK, CSK, ABL, ZAP70, FES, FAK, JAK, and ACK, can also 
be utilized by the rational modulator design method. 

25 Design by Searching Molecular Data Bases 

One method of rational modulator design searches 
for modulators by docking the computer representation of 
compounds from a data base of molecules. Publicly 
available data bases include: 

30 

a) ACD from Molecular Designs Limited 
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36 

b) NCI from National Cancer Institute 

c) CCDC from Cane ridge Crystal iographic Data Center 
d i CAST from Chemical Abstract Service 

e) Derwent from Derwent Information Limited 

f) Maybridge from Maybridge Chemical Company LTD 

g) Aldrich from Aldrich Chemical Company 

h) Directory of Natural Products from Chapman & Hall 

One such data base (ACD distributed by Molecular Designs 
Limited Information Systems) contains, for example, 
200,000 compounds that are synthetically derived or are 
natural products. Methods available to those skilled in 
the art can convert a data set represented in two 
dimensions to one represented in three dimensions. 
These methods are enabled by such computer programs as 
CONCORD from Tripos Associates or DB-Converter from 
Molecular Simulations Limited. 

Multiple methods of structure-based modulator 
design are known to those in the art. Kuntz et al . , 
1982, J. Mol. Biol. 162i 269; Kuntz et al . , 1994, 
Acc. Chem. Res. 27: 117; Meng et al., 1992, J. Compt. 
Chem. 13: 505; Bohm, 1994, J. Comp. Aided Molec. Design 
8: 623. 

A computer program widely utilized by those skilled 
in the art of rational modulator design is DOCK from the 
University of California in San Francisco. The general 
methods utilized by this computer program and programs 
like it are described in three applications below. More 
detailed information regarding some of these techniques 
can be found in the Molecular Simulations User Guide, 
1995. 
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A typical computer program used for this purpose 
car. co-prise the following seeps : 

a; remove the existing compound from -he protein,- 
ib> dock -he structure of another compound mcc 
.the active-site using the computer program (such as 
DOCK) cr by interactively moving the compound into the 
active-site; 

(c) characterize the space between the compound 
and the active -site atoms; 

(d) search libraries for molecular fragments which 
(i)can fit into the empty space between the compound and 
the active-site, and (ii) can be linked to the compound; 
and 

(e) link the fragments found above to the compound 
and evaluate the new modified compound. 

Part (c) refers to characterizing the geometry and 
the complementary interactions formed between the atoms 
of the active-site and the compounds. A favorable 
geometric fit is attained when a significant surface 
area is shared between the compound and active -site 
atoms without forming unfavorable steric interactions. 

One skilled in the art would note that the method 
can be performed by skipping parts (d) and (e) and . 
screening a data base of many compounds. 

Structure -based design and identification of 
modulators of PTK function can be used in conjunction 
with assay screening. As large computer data base of 
compounds (around 10,000 compounds) can be searched in a 
matter of hours, the computer based method can narrow 
the compounds tested as potential modulators of PTK 
function in cellular assays. 
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The above descriptions of s:ructure-based ncdula:cr 
design are not all encompassing and other methods are 
recorded in the literature: 

(1) CAVEAT: 3artlett et al., 1939, in "Chemical and 
Biological Problems in Molecular Recognition"; Roberts, 
3.M.; Ley, S.V.; Campbell, M.M. eds . ; Royal Society of 
Chemistry: Cambridge, ppl82-196. 

(2) FLOG: Miller et al., 1994, J. Comp. Aided 
Molec. Design 5:153. 

(3) PRO Modulator: Clark et al . , 1995, J . Comp. 

Aided Molec. Design 9:13. 

■ (4) MCSS: Miranker and Karplus, 1991, Proteins: 
Structure, Function, and Genetics 11:29. 

(5) AUTODOCK: Goodsell and Olson, 1990, Proteins: 
Structure , Function, and Genetics 8:195. 

(6) GRID: Goodford, 1985, J. Med. Chem. 28:349. 

pgfiiem bv flrvH fyina Compounds in romPlex with PTKS 

Another way of identifying compounds as potential 

modulators is to modify an existing modulator in the 

polypeptide active-site. For example, the computer 

representation of modulators can be modified within the 

computer representation of a PTK active-site. Detailed 

instructions for this technique can be found in the 

Molecular Simulations User Manual, 1995 in LUDI . The 

computer representation of the modulator is modified by 

the deletion of a chemical group or groups or by the 

addition of a chemical group or groups. 

Upon each modification to the compound, the atoms 

of the modified compound and active-site can be shifted 

in conformation and the distance between the modulator 
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representations of compounds from a computer data case 
that have similar three dimensional structures and 
similar chemical groups as compound bound to a ?TK . 

A typical program can operate by using the 
following general steps: 

(a) map the compounds by chemical features such as 
by hydrogen bond donors or acceptors, 
hydrophobic/lipophilic sites, positively ionizable 
sites, or negatively ionizable sites; 

(b) add geometric constraints to the mapped 
features; and 

(c) search data bases with the model generated in 

(b) . 

Those skilled in the art recognize that for 
indolinones, the important chemical features include, 
but are not limited to, a hydrogen bond donor, a 
hydrogen bond acceptor, and two hydrophobic points of 
contact. Those skilled in the art also recognize that 
not all of the possible chemical features of the 
compound need be present in the model of (b) . One can 
use any subset of the model to generate different models 
for data base searches. 

IX. Qyganig Synthetic Techniques 

The versatility of computer-based modulator design 
and identification lies in the diversity of structures 
screened by the computer programs. The computer 
programs can search data bases that contain 200,000 
molecules and can modify modulators already complexed 
with the enzyme with a wide variety of chemical 
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rur.ct lor.ai groups. A consequence cf "his chemical 
diversity is that a po-sr.rial modulator of ?TX function 
may take a chemical form that is net predictable. A 
wide array of organic synthetic techniques exist in the 
5 art. to meet the challenge of constructing these 

potential modulators of ?TK function. Many of these 
organic synthetic methods are described in detail in 
standard reference sources utilized by those skilled in 
the art. One example of such a reference is March, 

1° ■ 1994, Advanced Organic Chemistry; React ions. Mgrhanisms. 
and Structure. New York, McGraw Hill. Thus, the 
techniques required to synthesize a potential modulator^ 
of ?TK function identified by computer-based methods are 
readily available to those skilled in the art of organic 

15 chemical synthesis. 

X. Cellular Assays Measuring the Effe ct of a PTK 
Modulator in Signal Transduc tion Pathways 

20 Cellular assays can be used to test the activity of 

a potential modulator of PTK function as well as 
diagnose a disease associated with inappropriate PTK 
activity. A potential modulator of PTK function can be 
tested for activity in vitro by assays that measure the 

25 effect of a potential modulator on the 

autophosphorylation of a particular PTK over-expressed 
in a cell line. Thus, a modulator that acts as a potent 
inhibitor of the catalytic domain corresponding to a PTK 
would decrease the amount of autophosphorylation 

3 0 catalyzed by that PTK. Potential modulators could also 
be tested for activity in cell growth assays in vitro as 
well as in animal model assays in vivo. 
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In vivo assays are also useful ' for :es:;r.g the 
b:cacciv::y of a potential modulator designed by -he 
^ezhods of the invention. 

Materials, methods, and experimental data for these 
5 assays are fully described in WO 96/40116 published on 
December 19, 1996, entitled "Indolmone Compounds for 
the Treatment of Disease" . This application is 
incorporated herein by reference in its entirety, 
including all drawings, figures, and tables. 

10 

XI. Administration of Modulators of PTK Function as 
Therapeutics for Disease 

Methods of administering compounds to organisms as 
therapeutics for disease are fully described in WO 
15 96/40116 published on December 19, 1996, entitled 

"Indolinone Compounds for the Treatment of Disease". 
This application is incorporated herein by reference in 
its entirety, including all drawings, figures, and 
tables. 

20 

EXAMPLES 

The examples below are non- limiting and are merely 
representative of various aspects and features of the 
present invention. The examples provide illustrative 

25 methods for obtaining crystalline forms of protein 
kinase polypeptides, methods for determining three 
dimensional structures of these protein kinase 
polypeptides, and methods for identifying modulators of 
protein kinases using the three dimensional structures 

30 of the protein kinases. 
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A recombinant bacuiovirus was engineered c 



residues 456-755 of human FGFRl . A cleavable N-cer--a 
histidme rag was incorporated to aid ia orotein 
purification. Three amino acid substitutions wer- 
introduced: Cys-488 to Ala, Cys-584 to Ser and L eu - 4 37 
to Val. The two cysteine substitutions were made to 
prevent the formation of disulfide- linked oligomers 
which occurs for the native protein. The substitution 
Leu-457 to val introduced a »coI cloning site near Met- 
456. The codon for Tyr-766 (TAC) was changed to a stop 
codon (TAG) and a « a dTIl-cloni„g site was generated 
following this stop codon. These substitutions were 
introduced into the full length human cDNA of FGF^ in 
ml3MPI9 by site-directed mutagenesis according to the 
manufacturer's protocol (Amersham) . 

The resulting construct was digested with Ncol and 
Hmdlli and was ligated into appropriately digested 
pBlueflac HistagB (Invitrogen) . Transfection of insect 
cells ( S f9) was performed with the BaculoGold 
transfection system according to the manufacturer's 
protocol (Pharmingen) . Following identification of 
positive plaques, the recombinant bacuiovirus was 
amplified to high titer (5x10' virus particles/ml, . S f9 
cells were grown in 175-cm* flasks to a density of 2- 
3x10' per flask and infected with recombinant bacuiovirus 
with a multiplicity of infection (MOD of 10. 

After 48 hr, cells were harvested by centrifugation 
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a: 3,0Q0g for 3 5 T.:r. a: and then iysed in 25 mM 

HE?ES (pH 7.5), 150 mM NaCl , 10% glycerol, 1.5 mM MgCl 2 , 
1 % Tr:::n X-100, 10 ug/ml aproconin, 10 ug/ml 
ieupeptin, and 1 mM pher.ylmethylsul f onyl fluoride 
(PMSF). Lysaces were centrifuged in a Sorval RC 5C 
(Dupont) for 1 hr at 4°C at 40,000g followed by 
ultracentrif ugacion in an XL-80 (Beckman) at 100,G00g 
for 1 hr. After centrif ugation, the clarified iysate 
was passed over a Ni 2 * -chelating column (Pharmacia) , and 
the bound hist idine- tagged fusion protein was eluted 
with 100 mM imidazole (pH 7.5). Pooled fractions were 
loaded onto a Mono Q anion exchange column (Pharmacia) 
and eluted with a NaCl gradient from 0 to 500 mM. 

The fractions containing the fusion protein were 
concentrated in a Centricon-30 (Amicon) , and the 
histidine tag was removed by overnight digestion with 
enterokinase (Biozyme) at 20°C. The digestion was 
terminated by the addition of aprotonin, leupeptin, 
PMSF, TPCK, and bovine pancreatic trypsin inhibitor 
(BPTI) . The cleaved kinase domain was then separated 
from the histidine tag on a Superose 12 size-exclusion 
column (Pharmacia) . The eluted kinase domain was 
further purified on a Mono Q column. The purified 
kinase domain was analyzed by N- terminal sequencing and 
mass spectrometry. Five amino acids (SAAGT) remained 
from the histidine tag. The predicted molecular mass 
was confirmed by mass spectrometry. 

Crystal Growth 

Purified FGFR1 was concentrated to 20-50 mg/ml and 
exchanged into 10 mM Tris-HCl (pH 8.0), 10 mM NaCl, and 
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2 xM DTT using a Centricon- 30 . Crystals were crown at 
4°C by vapor diffusion in hanging drops containing 2.0 
— 1'- T.g/mi protein solution and 2.0 al of reservoir 
solution: 15% polyethylene glycol { PEG) 10000, 0.3 M 
(N"H ( ! ; SO., 5% ethylene glycol, and 100 mM bis-Tris ipH 
6.5). 

Crystals of native FGFR1 were soaked in 500 nil 
stabilizing solution [25% PEG 10000, 0.3 M (NH4) 2 S0 4 , 0.1 
M 3is-Tris {pH 6.5), 5% ethylene glycol] containing 3- 
[ (3- ( 2-carboxyethyl) -4 -methyipyrrol - 5 -yl ) methylene] -2- 
indoiinone (1-5 mM) or 3- [4- (4-formylpiperazine-i-yl) - 
benzylidenyl] -2 - indolinone (i mM) at 4°C for 24 to 48 
hours. The final soaking concentration of DMSO was 
between 1 to 5%. The crystals cracked at higher 
concentrations of DMSO. 

Co-crystals of FGFR1 with the inhibitors could also 
be obtained by vapor diffusion in hanging drops 
containing 2.0 /il of 10 mg/ml protein solution and 2.0 
Ml of reservoir solution containing 1 mM 3-[(3-<2- 
carboxyethyl) -4 -methyipyrrol- 5 -yl) methylene] -2- 
indolinone and 3- [4- (4-formylpiperazine-l-yl- 
) benzylidenyl] -2 -indolinone . 

Co-crystals of FGFR1 complexed with AMP-PCP were 
obtained as described for the creation of native 
crystals, except that the protein solution additionally 
contained 10 mM AMP-PCP and 20 mM MgCl 2 . 

Preparation Of Heaw Atom Derivative Crystals 

Heavy atom derivative crystals were obtained by 
soaking FGFR1 native crystals (C2-A form) in a solution 
containing ethylmercurithiosalicylic acid (thimerosal) ., 
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:<Au(CN*) 2 cr 4 -chlorcmercuribenzcic acid, as provided in 
Table 1, infra,, and containing 25% PEG 100CO. 0 . 3M 
..MH,.,33 4 , 5% et.-.viene glycol or glycerol, and ICC x.M 
bis-Tris (pK 5.5;, and were flash-cooled either :r. 
liquid nitrogen directly (Synchrotron) or in a dry 
nitrogen stream at -175°C (rotating anode) . 

^ra mil at i nr. a r H ^^'fture Determination 

For native crystals and crystals comprising the 
nucleotide analog AMP-PCP, data were collected either on 
a Rigaku RU-200 rotating anode operated at 50 kV and 100 
mA (Cu Ka) and equipped with double- focusing mirrors and 
an R-AXIS IIC image plate detector, or at beamline X-4A 
at the National Synchrotron Light Source, Brookhaven 
National Laboratory. Synchrotron data (X-1.07A) were 
collected on Fuji image plates and read with a Fuji 
scanner. One cryo-cooled crystal was used for each of 
the data sets. To obtain cryo-cooled crystals, crystals 
were soaked in a cryo-protectant solution containing 25% 
PEG 10000, 0.3 M (NH 4 ),SO,, 5* ethylene glycol or 
glycerol and 100 mM bis-Tris (pH 6.5), and were flash- 
cooled either in liquid nitrogen directly (synchrotron 
data) or in a dry nitrogen stream at -17 5°C (rotating 
anode data) . All data were processed using DENZO and 
SCALEPACK. Otwinowski, 1993, "Oscillation data 
reduction program," Proceedings of the CCP4 Study 
weekend, Sawyer et al., eds . (Daresbury, United Kingdom: 
SERC Daresbury Laboratory), 56-62. 

For native crystals and crystals comprising the 
nucleotide analog AMP-PCP, a molecular replacement 
solution was found initially for the C2-B crystal form 
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using an IRK search u.ode 1 that consisted of pciyaianme 
with the common side chains for residues 993-1263 { FGFRI 
residues 475-754), excluding residues 1094-1105 (kinase 
insert) and 1153-1170 (activation loop). With AiMORE 
5 (Navaza, 1994, AmoRe : an automated package for molecular 
replacement," Acta Cryscallogr. A50: 157-163), using 30% 
of the structure factor amplitudes between 15.0 and 3.5 
A, one of the two molecules in the asymmetric unit was 
located. The correlation coefficient (c.c.) for the 

10 correct 1-moiecule solution was 0.23 (versus 0.20 for 
the highest incorrect solution) . This molecule was 
rigid body-refined in X-PLOR (3riinger, 1992, X-PLOR 
(Version 3.1) Manual (New Haven, Conneticut: The Howeard 
Hughes Medircal -Institute and Department of Molecular 

15 Biophysics and Biochemistry, Yale Uiversity) ) , first as 
one rigid body unit, then as two units each comprising a 
lobe of the kinase. Rigid body refinement (12.0-3.5 A, 
F>3a) resulted in a relative rotation of the twc lobes 
of -10° and an increase of the c.c. from 0.20 to 0.25. 

2 0 The rigid body-refined molecule was then used as a new 
search model in AMORE, and this time both molecules in 
the asymmetric unit were located. The c.c. for the 
correct 2 -molecule solution was 0.3 5 (versus 0.27 for 
the highest incorrect solution) . 

25 Multiple cycles of model building and refinement 

against 6.0-2.4 A data resulted in the addition to the 
model of many of the side chains and some of the missing 
polypeptide chain. Model building was performed using 
T0M/FR0D0 (Jones, 1985, "Diffraction methods for 

30 biological macromolecules . Interactive computer 

graphics: FRODO," Methods in Enzymology 115: 157-171) 
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and conjugate-gradient minimization and simulated 
annealing were performed using X-PLCR. Brunger, supra. 
At this s cage , the R-value was 30% (free R-value of 
36%). To help expedite node 1 building and refinement, 
experimental phases were obtained. Because crystals 
grown in the presence of ethylene glycol were easier to 
manipulate than those grown in glycerol, several heavy- 
atom derivative data sets were collected from C2-A 
crystals that had been soaked in various heavy atom 
solutions. The C2-3 structure was subsequently refined 
against 6.0-2.4 A data to an R-value of 23.8% (free R- 
vaiue of 30.4%) with r.m.s.d. values of 0.008 A for bond 
distances and 1.4° for bond angles. 

Molecular replacement was used to locate the two 
FGFR1 molecules (designated FLGK-A and FLGK-B) in the 
'asymmetric unit of the C2-A crystal form. Using AMORE 
with 80% of structure factor amplitudes between 15.0 and 
3.5 A and the C2-B model, the c.c. for the correct 2- 
molecule solution was 0.62 (versus 0.35 for the highest 
0 incorrect solution) . Heavy atom positions were 

determined from difference Fourier maps using the 
calculated phases from the partial model. Refinement of 
heavy atom parameters and phase determination were 
performed with MLPHARE (Otwinowski , 1991, "Maximum 
5 likelihood refinement of heavy atom parameters," 

Isomorphous replacement and anomolous Ssattering r Evans 
and Leslie eds. (Darsbury, United Kingdom: SERC 
Daresbury Laboratory), 56-62)). An initial molecular 
isomorphous replacement (MIR) -phased electron density 
0 map was calculated with data between 2.0. and 2.8 A 
resolution. This map was improved by solvent 
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flattening, histogram matching and « 0 -— -r - , 
symmetry (NCS) averaging using DM (Cowcan. 1994, 
"Protein Crystallography," CCP4 and SSF-SAC3M Newsleczer 
tjoinz) 31 34 -38) . 

Refinement of the C2-A FGFR1 structure against s.0- 
2.0 A data proceeded by conjugate-gradient minimization 
and simulated annealing using X-PLOR. Tight NCS 
restraints were imposed until data to 2 . 0 A resolution 
were included in the refinement, at which point the 
restraints were lifted. An overall anisotropic B-value 
was calculated using X-PLOR and applied to the observed 
structure factors, reducing the R-value by -3%. Water 
molecules whose 3-values refined to ,70 A 2 were omitted 
from the subsequent refinement round. The average B- 
value is 37.5 A 2 for all protein atoms, 35.4 A 2 for 
protein, atoms in FLGK-A, 39.7 A 2 for protein atoms in 
FLGK-B. and 40.2 A 2 for water molecules. The side chains 
for Cys-603 in FLGK-A and FLGK-B and for Met-534 in 
FLGK-B have been modeled in two different conformations. 
Residues that are not included in the atomic model due 
to poor supporting electron density are for FLGK-A : 456- 
463, 486-490, 501-504, 580-591, 763-765; and for FLG-B : 
456-460, 501-504, 578-593, 646-651, 657-659, 762-765. 

The positions of the two AMP-PCP molecules (one per 
FGFRl molecule) were easily identified in 2F a-leo . 0 ^„ ) - 
Fc.ic.Mm. difference Fourier maps. The AMP-PCP molecule 
bound to FLGK-B is less tightly bound and. has been 
modeled with an occupancy of 0.5. 

Table A summarizes the X-ray crystallography data 
sets of FGFRl derivative crystals that were used to 
determine the structures of crystalline FGFRl and 
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crystalline FGF RL : AMP- PC? co-complex of the invent: ion . 



tapl e s 



Data Collection and MIR Phasing Summary 





Native 


AMP-PCP 


Thi-1' 


Thi-2 1 


PCMB* 


KAu(CNV 


X-ray source 


X-4A 


RU-200 


RU-200 


RU-200 


RU-200 


RU-200 


Resolution limit (A) 


2.0 


2.3 


2.6 


2.8 


2.8 


2.8 


Number of sites 


— 


— 


4 


7 


2 


2 


Cone. (mMVtime (h) 






0.1/24 


0.1/48 


0.2/2 


5.0/72 




4.8(19.7)* 


4.5(23.3) c * 


5.5 


9.8 


6.8 


6.8 


Total observations 


122569 


91324 


55456 


59488 


67088 


45303 


Unique reflections 


50771 


31997 


42820* 


35538 d 


18619 


18202 


Completeness (%) 


97.3(96.3)' 


95.5(93.7)' 


95.0 


96.7 


98.0 


97.7 


Signal (%i>3o) 


80.7(50.3) c 


79.6(5 1.7) e 


69.8 


66.8 


84.7 


77.6 


R iw e (%) 






17.1 


31.2 


15.4 


15.2 


Phasing power* 






1.8 


2.0 


1.0 


0.9 








0.55 


0.50 


0.81 


0.84 


Overall FOM h 










0.60 





■Thi-l, Thi-2; ethylmercurithiosalicylic acid (thimerosal); PCMB: 4-chloromercuribenzoic acid. 

lOOxSAHCh^Kh^l/SJ^fh) 
c Vaiue in parentheses is for the highest resolution shell. 

4 I(+h) and I(-h) processed as independent reflections. Anomalous scattering contributions were 
5 included. 

•R^ - 100 x 2, | IF^F^HFphOOI |/I,|F p (h)| f where F p and F w are the native and derivative 
structure factors, respectively. 

'Phasing power r.m.s. heavy atom structure factor / r.m.s. lack of closure (for acentric reflections 
from 20.0 to 2.8A). 

0 - 100 x E h | |F w (h)|-F We-te> (h)|/IJFp H (h>4:F p (h)| (for centric reflections from 20.0 to 2.8A). 

"Figure of merit: JP(4»exp(i$)d<fr/ JP(<t>)d(4>), where P is the probability distribution of the phase 
angle <t>. 
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For crystals comprising FGFRI and compounds I and 
2, data were collected on a Rigaku RU-200 rotating anode 
'.Cu Kco operating at 50 kv and 100 rA and equipped with 
double- focusing mirrors and an R- AXIS IIC image plate 
5 detector. One cryo-cooled crystal was used for each of 
the data sets. Crystals were soaked in a 
cryo-protectant [25% PEG 10000, 0.3 M (NH,),SO,, 5% 
ethylene glycol, 100 mM bis-Tris (pH 6.5), and 1 mM: 3- 
[ (3- (2-carboxyethyl) -4-methylpyrrol-5-yl) methylene] -2- 

10 ■ indolinone (hereafter referred to as compound 1) or 3- 
[4- (4-formylpiperazine-l-yl-) benzyl idenyl] - 2 - indolinone 
(hereafter referred to as compound 2) and flash-cooled 
in a dry nitrogen stream at -17S°C. Data were processed 
using DENZO and SCALE PACK. Otwinowski, 1993, 

15 Proceedings of the CCP4 Study Weekend (Daresbury, United 
Kingdom; SERC Daresbury Laboratory) pp 56-62. 

A summary of the data collection parameters are 
included in the following Table 6: 

20 TABLE 6 





Resolution 

limit (A) 


Observa- 
tions (N) 


Complete- 
ness (%) 


Redundan- 
cy 




Signal 
<I> ol) 


compound 

1 


2.5 


93535 


97.6 (96.1) 


2.7 


6.8 

(23.0) 


11.8 


compound 

2 


2.4 


94093 


99.1 (97.9) 


3.3 


6.3 

(322) 


11.4 



compound 1 structure: 550 residues, 252 water molecules, 2 compound 1 molecules (4589 atoms) 
compound 2 structure: 550 residues, 248 water molecules, 2 compound 2 molecules (4646 atoms) 



30 



Structure Analyses 

Atomic superpositions were performed with TOSS 
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( Hendr ickson , 1979;. Per residue solvent, accessible 
surface calculations were done with X-FLOR. The surface 
area buried in a diner interface was calculated with 
GRASP (Nicholls ec al . , 1991) using a probe radius of 
1.4 A. The stereochemical quality of the atomic model 
was monitored using PROCHSCK (Laskowski en al . , 1993, 
PROCHECK: a computer program to check the stereochemical 
quality of protein structures," J. Appl . Cryst. 26: 283- 
291) . As defined in PROCHECK, 93% of the residues in 
the model have mam-chain torsion angles in the most 
favored Ramachandran regions. There are no residues in 
disallowed regions, and three residues in generously 
allowed regions: Arg-622 in FLGK-A and FLGK-B and Arg- 
554 in FLGK-A. The overall G- factor score is 0.42. 

Table 7 summarizes the X-ray crystallography 
refinement parameters of the structures of crystalline 
FGFR1 and crystalline FGPRL: AMP-PCP co-complex of the 
invention. Table 8 summarizes the X-ray crystallography 
refinement parameters for the FGFR1/ compound complexes. 



TABLE 7 

Refinement Parameters 

FGFR1: 550 residues, 252 water molecules (4589 atoms) 

FGFR1: AMP-PCP: 550 residues, 238 water molecules, 2 AMP-PCP molecules (4638 atoms) 

Model d-spacings Reflection R-value* R.m.s.d. 

s 

(A) (N) (%) bonds (A) angles (°) B- values 6 

(A 2 ) 

FGFR1: 6.0-2.0 42548 21.3 (262? 0.008 1.3 16 
FGFR1: AMP-PCP: 6.0-2.3 26729 20.1 (27.5)' 0.009 1_4 1.7 

'R-value = 100 x S h | IF^h)) - jF^h)! | / 1*^001 for reflections with F o6t >2o. 
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5 For bonded protein atoms. 

: Value in parentheses is the free R-value (Brunger. 1993) determined from 5% of the data. 



TABLE 8 



Model 


d-spacings (A) 


Reflec- 
tions 


R- 

value' (N) 


bonds (A) 


angles (°) 


B- 

values' 
(A : ) 


compound 

1 


6.0-2.4 


42548 


19.7 
(27.0)' 


0.008 


1.3 


1.6 


compound 

*> 


6.0-2.5 


26729 


20.0 
(28.0)* 


0.008 


1.4 


1.7 



10 

■R^ » 100 x S>S, |I ; (h) - I(h)°| / S^S, h(h) 

c Value in parentheses is for the highest resolution shell. 

'R-value = 1 00 x S h j|F 0 (h)| - |F e (h)|| / S fc |F 0 (h)|, where F 0 and F e are the observed and caJcuiated 
structure factors, respectively (F 0 > 2s). 
15 'For bonded protein atoms. 

k Value in parentheses is the free R-vaiue determined from 5% of the data. 

Atomic Structural Coordinates 

Tables i and. 2 provide the atomic structural 

20 coordinates of unphosphorylated FGFR1 and 

unphosphorylated FGFR1 : AMP - PCP co-complex, respectively. 
In the Tables, coordinates for both of the FGFR1 
molecules of the dimer comprising the asymmetric unit 
are provided. The amino acid residue numbers coincide 

25 with those used in FIG. 3. In the first FGFR1 molecule 
of the dimer the residue number is preceded by a 1, 
i.e. . residue number 464 of the first FGFR1 molecule of 
the dimer is denoted by "1464". Tables 3 and 4 provide 
the atomic structural coordinates of FGFR1 in complex 

30 with indolinone compounds found to inhibit FGFR1 
function. 
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The following abbreviations are used in tne Tables: 
"Acorn Type" refers to the element whose coordinates 

are provided. The first letter in the column defines 

the element 
5 "A. A. " refers to amino acid. 

"X. Y and Z" provide the Cartesian coordinates of 

the element. 

"3" is a thermal factor that measures movement of 
the atom around its atomic center. 
10 " QCC " refers to occupancy, and represents the 

percentage of time the atom type occupies the particular 
coordinate. OCC values range from 0 to 1, with 1 being 
100%. 

" PRTl " or " PRT2 » relate to occupancy, with PRT1 
15 designating the coordinates of the atom when in the 

first conformation and PRT2 designating the coordinates 
of the atom when in the second or alternate 
conformation. 

Structural coordinates for FGFR1 may be modified by 
20 mathematical manipulation. Such manipulations include, 
but are not limited to, crystallographic permutations of 
the raw structure coordinates, f ractionalization of the 
raw structure coordinates, integer additions or 
subtractions to sets of the raw structure coordinates, 
2 5 inversion of the raw structure coordinates and any 
combination of the above. 

In addition, the structural coordinates can be 
slightly modified and still render nearly identical 
three dimensional structures. Therefore, a measure of a 
30 unique set of structural coordinates is the root-mean- 
square deviation of the resulting structure. Structural 
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coordinates chat render three dimensional structures 
that deviate from one another by a root -mean -square 
deviation of less than 1.5 A may be viewed as identical. 

E XAMPL E 2 : Computer-Based Design of Mo dulators 

PTK Function 

Potential modulators of PTK function were designed 
and identified by operating the program Catalyst on the 
structure of 3 - [ (3- (2-carboxyethyl) -4 -methylpyrrol-5- 
yl) methylene] -2- indolinone. The chemical features 
constraining the search model include a hydrogen bond 
donor, a hydrogen bond acceptor, and two hydrophobic 
points of contact. Approximately ao compounds were 
identified as potential modulators of PTK function using 
this method. 

The compounds identified by the method as potential 
modulators of PTK function were commercially available. 
These compounds were then tested for their ability to 
inhibit the FLK PTK in an enzyme linked immunosorbant 
assay (ELISA) . The method of performing this assay is 
taught in WO 96/40116, entitled "Indolinone Compounds 
for the Treatment of Disease," published on December 19, 
1996, invented by Tang et al., incorporated by reference 
herein in its entirety, including all figures, drawings, 
and -tables- Flk-1 specific antibodies cam be prepared 
from the following protocol: 

1. Prepare a Tresyl -Activated Agarose/Flk-l-D column 
by incubating 10 ml of Tresyl -Activated Agarose 
with 20 mg of purified GST-Flk-l-D fusion protein 
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in LOOmM sodium bicarbonate ipH 9.6) buffer 
overnight at 4°C. 

2. Wash tne column once with PBS. 

3. Slock the excess sites on the column with 2 M 
glycine for 2 hours at 4°C. 

4. Wash the column with PBS. 

5. Incubate the column with Rabbit anti-rlk-lD 
production bleed for 2 hours at 4°C. 

6. Wash the column with PBS . 

7. Elute antiserum with 100 mM Citric Acid, pH3 . 0 and 
neutralize the eluate immediately with 2 M Tns, pH 
9.0. 

8. Dialyize the eluate against PBS overnight at 4oC 
with 3 changes of buffer (sample to buffer ratio is 
1:100) . 

9. Adjust the dialyized antiserum to 5% glycerol and 
store at -80°C in small aliquotes. 

The Flk-1 ELISA can include a 2 , 2-azino-bis (3 - 
0 ethylbenz-thiazoline-6-sulfonic acid (ABTS) solution, 

which can comprise lOOmM citric acid (anhydrous) , 250 mM 
Na 2 HP0 4 (pH 4.0), 0.5 mg/ml ABTS (Sigma catalog no. A- 
1888) . The solution is most appropriately stored in 
dark at 4°C until ready for use. 
5 The FLK-1 specific antibodies can also be purchased 

from Santa Cruz Biotechnology (Catalog No. SC-504) . 

Four of the forty compounds identified as potential 
modulators of PTK function were potent modulators of FLK 
function. These molecules have the following 
0 structures: 
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N-N 



V r 

A 

\ 



V 



The modulators inhibit the FLK protein kinase with the 
following IC S0 values: 

TABLE 9 



10 



Compound 


FLK kinase 


FLK kinase 


EGFR 


IGF-1R 




ic w 


ic i0 


1C, 0 


ic* 




(MM) 


(MM) 


(MM) 






compounds 


compounds 








tested at IOOmM 


tested at 20*jM 






I 


14.8 


14 


>100 


>100 


2 


15.7 


10.6 


>I00 


>!00 




21.4 


16.6 


68 


30.9 


4 


22.9 


16.4 


>!00 


>I00 



15 



The invention illustratively described herein may 
be practiced in the absence of any element or elements, 
limitation or limitations which is not specifically 
disclosed herein. The terms and expressions which have 
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oeen employed are used as :erns of description and not 
of limitat icr. , and there is no in:er.:ior. that m the use 
of such terms and expressions of excluding any 
equivalents of the features shown and described or 
portions thereof, but it is recognized that various 
modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that 
although the present invention has been specifically 
disclosed by preferred embodiments and optional 
features, modification and variation of the concepts 
herein disclosed may be resorted to by those skilled in 
the art, and that such modifications and variations are 
considered to be within the scope of this invention as 
defined by the appended claims. 

Those references not previously incorporated herein 
by reference, including both patent and non-patent 
references, are expressly incorporated herein by 
reference for all purposes. Other embodiments are 
within the following claims. 
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(1) APPLICANT: 



SUGEN . INCORPORATED 
351 Galveston Drive 
Redwood City, CA 94063 



TITLE OF INVENTION: 



CRYSTAL STRUCTURES OF A 
PROTEIN TYROSINE KINASE 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS: 



(A) 
(B) 

(C) 
(D) 
■(E) 
(F) 



ADDRESSEE: 
STREET : 

CITY: 
STATE : 
COUNTRY : 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE : 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2.0 



(vi) CURRENT APPLICATION DATA: 



(A) APPLICATION NUMBER: 
<B) FILING DATE: 
(C) CLASSIFICATION: 



To Be Assigned 
Herewith 



(vii) PRIOR APPLICATION DATA: 



(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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ATTORNEY /AGENT INFORMATION: 

; A) N'AME : Warburg, Richard 

-3j REGISTRATION NUMBER: 3 2,32 7 

.Cj REFERENCE /DOCKET NUMBER: 22 7/033-PCT 



(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (213) 439-1600 

(Bi TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR 3EQ ID NO : L : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 ammo acids 

{ 3 ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Leu Ala Gly Val Ser Glu Tyr Glu Leu Pro Gl - Asp Pro Arg Trp 
1 5 10 IS 

Glu Leu Pro Arg Asp Arg Leu Val Leu Gly Lys Pro Leu Gly Glu Gly 
20 25 30 

Cys Phe Gly Gin Val Val Leu Ala Glu Ala He Gly Leu Asp Lys Asp 
35 40 45 

Lys Pro Asn Arg Val Thr Lys Val Ala Val Lys Met Leu Lys Ser Asp 
50 55 60 

Ala Thr Glu Lys Asp Leu Ser Asp Leu He Ser Glu Met Glu Met Met 
65 70 75 80 

Lys Met He Gly Lys His Lys Asn He He Asn Leu Leu Gly Ala Cys 
85 90 95 

Thr Gin Asp Gly Pro Leu Tyr Val He Val Glu Tyr Ala Ser Lys Gly 
100 105 HO 

Asn Leu Arg Glu Tyr Leu Gin Ala Arg Arg Pro Pro Gly Leu Glu Tyr 

115 120 125 

Cys Tyr Asn Pro Ser His Asn Pro Glu Glu Gin Leu Ser Ser Lys Asp 
130 135 14° 
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leu Val Ser Cys Ala Tyr Gin Val Ala Arg Gly Met Glu Tyr leu Ala 
145 150 155 * iso 



Lys Cys lie His Arg Asp leu Ala Ala Arg Asn Val Leu Val 
165 170 175 



Thr Glu As? Asn Val Met Lys lie Ala Asp Phe Gly leu Ala Arg As? 
ISO 185 190 

lie His His He Asp Tyr Tyr Lys Lys Thr Thr Asn Gly Arg Leu Pro 
195 200 205 

Val Lys Trp Met Ala Pro Glu Ala Leu Phe Asp Arg He Tyr Thr His 
210 215 220 

Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu He Phe Thr 
225 230 * 235 240 

Leu Gly Gly Ser Pro Tyr Pro Gly Val Pro Val Glu Glu Leu Phe Lys 
245 250 255 

Leu Leu Lys Glu Gly His Arg Met Asp Lys Pro Ser Asn Cys Thr Asn 
260 265 270 

Glu Leu Tyr Met Met Met Arg Asp Cys Trp His Ala Val Pro Ser Gin 
275 280 285 

Arg Pro Thr Phe Lys Gin Leu Val Glu Asp Leu Asp Arg lie Val Ala 
290 295 300 



Leu Thr Ser Asn Gin Glu 
305 310 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 
(iii) HYPOTHETICAL : 



315 amino acids 
amino acid 
single 
linear 

protein 

NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser <31u Tyr Glu Leu Pro 
15 10 15 

Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu Val Leu Gly Lys 
20 25 30 
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?rc Leu Gly Glu Gly Ala ?he Giy Gin Vai Val Leu Ala Giu Ala lie 
25 40 45 

Z i y Leu Asp Lys Asp Lys Pro Asn Arg Val Thr Lys Val Ala Val Lys 
5C 55 60 

Met Leu Lys Ser Asp Ala Thr Glu Lys Asp Leu Ser Asp Leu He Ser 

65 70 75 30 

Glu Met Glu Met Met Lys Met He Gly Lys His Lys Asn He He Asn 

35 90 95 

Leu Leu Gly Ala Cys Thr Gin Asp Gly Pro Leu Tyr Val He Val Glu 
100 105 110 

Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu Gin Ala Arg Arg Pro 
115 120 ' 125 

Pro Gly Leu Glu Tyr Ser Tyr Asn Pro Ser His Asn Pro Glu Glu Gin 
130 135 140 

Leu Ser Ser Lys Asp Leu Val Ser Cys Ala Tyr Gin Val Ala Arg Gly 
145 150 155 160 

Met Glu Tyr Leu Ala Ser Lys Lys Cys He His Arg Asp Leu Ala Ala 

165 170 175 

Arg Asn Val Leu Val Thr Glu Asp Asn Val Met Lys He Ala Asp Phe 
180 185 190 

Gly Leu Ala Arg Asp He His His He Asp Tyr Tyr Lys Lys Thr Thr 
195 200 205 

Asn Gly Arg Leu Pro Val Lys Trp Met Ala Pro Glu Ala Leu Phe Asp 
210 215 220 

Arg He Tyr Thr His Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu 
225 230 235 240 

Trp Glu He Phe Thr Leu Gly Gly Ser Pro Tyr Pro Gly Val Pro Val 
245 250 255 

Glu Glu Leu Phe Lys Leu Leu Lys Glu Gly His Arg Met Asp Lys Pro 
260 265 270 

Ser Asn Cys Thr Asn Glu Leu Tyr Met Met Met Arg Asp Cys Trp His 
275 280 285 

Ala Val Pro Ser Gin Arg Pro Thr Phe Lys Gin Leu Val Glu Asp Leu 

290 295 300 



Asp Arg He Val Ala Leu Thr Ser Asn Gin Glu 

305 310 315 
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.2; INFORMATION FCR SEC 10 NO : 3 : 

(li SEQUENCE CHARACTERISTICS : 

;A, LENGTH: 351 ammo acids 

^ TYPE: amine acid 

<C) S73ANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(iii) HYPOTHETICAL : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Arg Gly Ser His His His His His His Gly Met Ala Ser Met Thr 
1 5 10 15 

Gly Gly Gin Gin Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 
20 25 30 

Pro Ser Ser Arg Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser Glu 
35 40 45 

Tyr Glu Leu Pro Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu 
50 55 60 

Val Leu Gly Lys Pro Leu Gly Glu Gly Ala Phe Gly Gin Val Val Leu 
65 ™ 75 go 

Ala Glu Ala He Gly Leu Asp Lys Asp Lys Pro Asn Arg Val Thr Lys 
85 90 95 

Val Ala Val Lys Met Leu Lys Ser Asp Ala Thr Glu Lys Asp Leu Ser 
100 1Q5 no 

Asp Leu He Ser Glu Met Glu Met Met Lys Met He Gly Lys His Lys 
115 120 125 

Asn He He Asn Leu Leu Gly Ala Cys Thr Gin Asp Gly Pro Leu Tyr 
130 135 140 

Val He Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu Gin 
145 150 155 160 

Ala Arg Arg Pro Pro Gly Leu Glu Tyr Ser Tyr Asn Pro Ser His Asn 
165 170 175 

Pro Glu Glu Gin Leu Ser Ser Lys Asp Leu Val Ser Cys Ala Tyr Gin 
180 185 190 

val Ala Arg Gly Met Glu Tyr Leu Ala Ser Lys Lys Cys He His Arg 
1^5 200 205 
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Asp Leu Ala Ala Arg A sr. 7a! leu 7al Thr Glu Asp Asn Val Met Lys 

213 215 220 

:ie Ala Asp Phe Gly leu Ala Arg Asp He His His lie Asp Tyr Tyr 

225 230 235 240 



lys Lys Thr Thr Asn 
245 

Ala Leu Phe Asp Arg 
260 

Gly Val Leu Leu Trp 
275 

Gly Val Pro Val Glu 
290 

Met Asp Lys Pro Ser 

305 

Asp Cys Trp His Ala 
325 

Val Glu Asp Leu Asp 
340 



Gly Arg Leu Pro Val Lys 
250 

He Tyr Thr His Gin Ser 
265 

Glu He Phe Thr Leu Gly 
280 

Glu Leu Phe Lys Leu Leu 
295 

Asn Cys Thr Asn Glu Leu 
310 315 

Val Pro Ser Gin Arg Pro 

330 

Arg He Val Ala Leu Thr 
345 



Trp Met Ala Pro Glu 
255 

Asp Val Trp Ser Phe 
270 

Gly Ser Pro Tyr Pro 
285 

Lys Glu Gly His Arg 
300 

Tyr Met Met Met Arg 

320 

Thr Phe Lys Gin Leu 
335 

Ser Asn Gin Glu 
350 



(2) INFORMATION FOR SEQ ID NO: 4: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 

<B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGCTAGCAG GGGTCTCTGA GTATGAGCTT CCCGAAGACC CTCGCTGGGA GCTGCCTCGG SO 

GACAGACTGG TCTTAGGCAA ACCCCTGGGA GAGGGCTGCT TTGGGCAGGT GGTGTTGGCA 120 

GAGGCTATCG GGCTGGACAA GGACAAACCC AACCGTGTGA CCAAAGTGGC TGTGAAGATG 180 

TTGAAGTCGG ACGCAACAGA GAAAGACTTG TCAGACCTGA TCTCAGAAAT GGAGATGATG 24 0 

AAGATGATCG GGAAGCATAA GAATATCATC AACCTGCTGG GGGCCTGCAC GCAGGATGGT 300 

CCCTTGTATG TCATCGTGGA GTATGCCTCC AAGGGCAACC TGCGGGAGTA CCTGCAGGCC 360 

CGGAGGCCCC CAGGGCTGGA ATACTGCTAC AACCCCAGCC ACAACCCAGA GGAGCAGCTC 4 20 
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TCCTCCAAGG ACC7SG7G7C C7GCGCC7AC CAGG7GGCCC GAGGCATGGA G7A7C7GGCC 



430 



7CCAAGAAG7 GCA7ACACCG AGACC7GGCA GCCAGGAA7G 7CC7GG7GAC AGAGGACAA7 54 0 



600 



G7GA7GAAGA 7AGCAGACT7 7GGCC7CGCA CGGGACA77C ACCACA7CGA C7AC7A7AAA 

AAGACAACCA ACGGCCGAC7 GCC7G7GAAG 7GGA7GGCAC CCGAGGCA77 A777GACCGG 660 

A7C7ACACCC ACCAGAGTGA TGTGTGG7CT TTCGGGGTGC 7CC7G7GGGA GA7C77CAC7 720 

C7GGGCGGCT CCCCATACCC CGGTGTGCCT GTGGAGGAAC TTTTCAAGC7 GCTGAAGGAG 79 0 

GG7CACCGCA 7GGACAAGCC CAG7AACTGC ACCAACGAGC TGTACATGAT GATGCGGGAC 340 

TGC7GGCA7G CAGTGCCCTC ACAGAGACCC ACC77CAAGC AGCTGGTGGA AGACCTGGAC 900 

CGCATCG7GG CCTTGACC7C CAACCAGGAG 7AG 933 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGCGGGG7T CTCATCATCA TCATCATCAT GGTATGGCTA GCATGACTGG 7GGACAGCAA 60 

ATGGGTCGGG ATCTGTACGA CGATGACGAT AAGGATCCGA GCTCGAGATC TGCAGCTGGT 120 

ACCATGGTAG CAGGGGTCTC TGAGTATGAG CTTCCCGAAG ACCCTCGCTG GGAGCTGCCT 180 

CGGGACAGAC TGGTCTTAGG CAAACCCCTG GGAGAGGGCG CCTTTGGGCA GGTGGTGTTG 240 

GCAGAGGCTA TCGGGCTGGA CAAGGACAAA CCCAACCGTG TGACCAAAGT GGCTGTGAAG 300 

ATGTTGAAGT CGGACGCAAC AGAGAAAGAC TTGTCAGACC TGATCTCAGA AATGGAGATG 360 

ATGAAGATGA TCGGGAAGCA TAAGAATATC ATCAACCTGC TGGGGGCCTG CACGCAGGAT 420 

GGTCCCTTGT ATGTCATCGT GGAGTATGCC TCCAAGGGCA ACCTGCGGGA GTACCTGCAG 480 

GCCCGGAGGC CCCCAGGGCT GGAATACTCC TACAACCCCA GCCACAACCC AGAGGAGCAG 540 

CTCTCCTCCA AGGACCTGGT GTCCTGCGCC TACCAGGTGG CCCGAGGCAT GGAGTATCTG 600 

GCCTCCAAGA AGTGCATACA CCGAGACCTG GCAGCCAGGA ATGTCCTGGT GACAGAGGAC 660 
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AA7GTGATGA AGA7AGCAGA C777GGCC7C GCACGGGACA 77CACCACA7 CGAC7AC7AT 72 0 

AAAAAGACAA CCAACGGCCG AC7GCC7G7G AAGTGGATGG CACCCGAGGC A77A777GAC 73 0 

CGGATC7ACA CCCACCAGAG TGATGTGTGG TCTTTCGGGG TGCTCCTGTG GGAGATCTTC 34 0 

AC7C7GGGCG GCTCCCCATA CCCCGGTGTG CCTGTGGAGG AACTTTTCAA GCTGCTGAAG 90 0 

GAGGGTCACC GCATGGACAA GCCCAG7AAC TGCACCAACG AGCTGTACAT GATGATGCGG 96 0 

GACTGCTGGC A7GCAG7GCC C7CACAGAGA CCCACCTTCA AGCAGCTGGT GGAAGACCTG 102 0 

GACCGCATCG TGGCCTTGAC CTCCAACCAG GAGTAG 10 5 6 
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A"om 

No. 



Atom 
Type 



A. A 



A. A 
No. 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 26 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 44 
ATOM 45 
ATOM 46 
ATOM 47 
ATOM 48 
ATOM 49 
ATOM 51 
ATOM 52 



3 
4 
5 
6 
7 
9 

1C 
11 
12 
13 
14 
15 
IS 
17 
18 
19 
20 
21 
22 
23 
25 



27 
28 
29 
30 
31 
32 
33 
35 
36 

37 

38 

39 

40 

41 

42 

43 



N* 

CA 
CB 
C 
O 
N 
CA 
CB 
CG 
CD1 
CD2 
C 
O 
N 

CD 
CA 
CB 
CG 
C 
O 
N 

CA 
CB 
CG 
CD 

OE1 

OE2 

C 

0 

N 

CA 

CB 

CG 

ODl 
002 
C 
0 
K 
CD 
CA 
CB 
CG 
C 
O 
N 

CA 
CB 



GL'J 
GLU 
GLU 
GLU 
GL'J 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
ARG 
ARG 
ARG 



1454 
1464 
1464 
1464 
1464 
1465 
1465 
1465 
1465 
1465 
1465 
1465 
1465 
1466 
1466 
1466 
1466 
1466 
1466 
1466 
1467 
1467 
1467 
1467 
1467 
1467 
1467 
1467 
1467 
1468 
1468 
1468 
1468 
1468 
1468 
1468 
1468 
1469 
1469 
1469 
1469 
1469 
1469 
1469 
1470 
1470 
1470 



-13 .639" 
-12 .479 
- 11 . 400 
-11.914 
-11.845 
-11.562 
-11.018 
-10.236 
-8.719 
-8.346 
-8.061 
-12.092 
-13.187 
-11.802 
-10.597 
-12.741 
-12 .110 
-10.629 
-12.846 
-11.847 
-14.060 
-14.268 
-15.744 
-16.375 
-17.819 
-18.446 
-18.314 
-13.838 
-13.899 
-13.299 
-12.883 
-11.384 
-10.985 
-11.833 
-9.817 
-13.252 
-12.481 
-14.43S 
-15.354 
-14.971 
-16.244 
-16.665 
-14.012 
'14.085 
-13.106 
-12.139 
'11.301 



16 . 975 
17. 105 
17. 974 
15. 738 
15 . 407 
14 . 925 
13 . 599 
13 .066 
13 .196 
14 .6S4 
12.671 
12.594 
12.590 
11.748 
11.793 
10.727 
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3.739 -5.723 30.919 1.00 53.59 

3.624 -7.535 26.930 1.00 35.40 

9.397 -3.119 27.753 1.00 36.57 
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10.557 -8.541 21.819 1.00 31.95 
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15.051 -6.440 19.245 1.00 29.58 
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10.479 -6.951 16.244 1.00 26.60 
10.758 -4.805 16.893 1.00 24.35 

9.745 -4.255 16.006 1.00 20.79 

8.495 -3.883 16.793 1.00 18.95 

7.723 -5.087 17.268 1.00 22.82 

6.442 -4.699 17.969 1.00 25.49 

5.560 -5.934 18.189 1.00 24.36 * 

4.892 -6.414 16.941 1.00 22.23 

10.254 -3.034 15.257 1.00 22.79 

10.613 -2.041 15.868 1.00 24.60 

10.259 -3.101 13.934 1.00 23.92 

10.707 -1.984 13.113 1.00 24.22 

10.92S -2.439 11.648 1.00 23.18 

11.270 -1.262 10.766 1.00 17.17 

12.068 -3.454 11.604 1.00 19.97 

11.975 -4.369 10.461 1.00 26.92 
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Whaz is claimed i 



/ « _ l m o_ a polypeptide 

corresponding to the catalytic domain of" a protein 
tyrosine kinase. 

2- The crystalline forrn of claim i, herein sa^ 
protein tyrosine kinase is a receptor protein tyrosine 
kinase . 



3- The crystalline form cf claim 2, wherein said 
receptor protein tyrosine kinase is selected from the 
group consisting of PDGF-R. flk, CCK4 , MET, TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS, RET , LTK, R0R1 , and MUSK. 

4. The crystalline form of claim 1, wherein said 
protein tyrosine kinase is a non-receptor protein 
tyrosine kinase. 



5. The crystalline form of claim 4, wherein said 
non-receptor protein tyrosine kinase is selected from a 
group consisting of SRC, BRK, BTK, CSK, ABL, ZAP70, FES, 
FAK, JAK, and ACK. 

6. The crystalline form of claim 1, comprising 
one or more heavy metal atoms. 



7. The crystalline form 



of claim l, wherein said 
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protein tyrosine kinase is :u:R. 

3. The crystalline form of claim 7, wherein said 
FGrR is FGFR1. 

9. The crystalline form of claim 8, defined by 
atomic structural coordinates set forth in Table 1. 

10. The crystalline form of claim 7, comprising at 
least one compound. 

11. The crystalline form of claim 10, wherein said 
compound is a nucleotide analog. 

12. The crystalline form of claim 11, wherein said 
nucleotide analog is AMP-PCP. 

13. The crystalline form of claim 12, defined by 
atomic structural coordinates set forth in Table 2. 

14. The crystalline form of claim 10, wherein said 
compound is an indolinone compound. 

15. The crystalline form of claim 14, wherein said 
indolinone compound has a structure set forth in formula 
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or a pharmaceutical^ acceptable salt, isomer, 
metabolite, ester, amide, or prodrug thereof , 'wherein 

(a) A l( A,, A,, and A, are independently carbon or 
nitrogen; 

(b) R x is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiolindolinone; 

(d) R 3 is hydrogen; 

<e) R 4 , r S/ r s , ^ Rt are opcionally present and are 
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either (i) independent l.y selected from the group 
consisting of hydrogen, alkyl, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihaiomethyl , 3(0)R, 
S0 2 NRR\ S0 } R, 5R, N0 2 , NRR ' , OH, CN, C(0)R, 0C(0)R, 
NHC ( 0) R , (CH 2 ) n C0 2 R, and CONRR 1 or (ii) any two adjacent 
R«, R 5 , R«, and R 7 taken together form a fused ring with 
the aryl portion of the oxindole-based portion of the 
mdolinone; 

(f) R 2 ', R 3 ', R 4 ', Rs'r and R 6 ' are each 
independently selected from the group consisting of 
hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihaiomethyl, S(0)R, S0 2 NRR', S0 3 R, 
SR, N0 2 , NRR\ OH, CN, C(0)R, 0C(0)R, NHC(0)R, (CH 2 ) :1 C0 2 R, 
and CONRR • ; 

(g) n is 0, 1, 2, or 3 ; 

(h) R is hydrogen, alkyl or aryl; 

(i) R' is hydrogen, alkyl or aryl; and 

(j) A is a five membered heteroaryl ring selected 
from the group consisting of thiophene, pyrrole, 
pyrazole, imidazole, 1 , 2 , 3 - triazole , 1 , 2 , 4 - triazole , 
oxazole, isoxazole, thiazole, isothiazole, furan, 1,2,3- 
oxadiazole , 1,2,4- oxadiazole , 1,2,5- oxadiazole , 1,3,4- 
oxadiazole, 1,2,3,4 -oxatriazole , 1,2,3, 5 -oxatriazole , 

1.2.3- thiadiazole , 1,2,4- thiadiazole , 1,2,5- thiadiazole , 

1.3.4- thiadiazole, 1, 2, 3 , 4-thiatriazole, 1,2,3,5- 
thiatriazole, and tetrazole, optionally substituted at 
one or more positions with alkyl, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihaiomethyl, S(0)R, 
S0 2 NRR', S0 3 R, SR, N0 2 , NRR', OH, CN, C(0)R, 0C(0)R, 
NHC(0)R, (CH 2 ) n C0 2 R or CONRR'. 
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IS . 



a3 



boxyechyl J -4 - 



— -rys.a^ir.e rorm of ciai-n i- u 
-udoliaone compound is 3- [4.(4 f^ m , • 

y-)oen2yl ld enyi]-2-indonnone. 

13. The crystalline form of clain is 
i- ^ cxaim 16, defined by 

.... a,o mic structural coordinates of Table 3. 

19- The crystalline form of claim 17 * * ■ 
thP a iaim 17 ' defined by 

he at ° miC St ™= tural coordinates of Table 4. ■ 

20. The crystalline form of claim !, having 
monoclinic unit cells. 

21. The crystalline form of claim 20. wherein „■„ 
monoclinic unit cells hav- w *erexn sa ld 
A b-57 « A , dlmensi °^ of about a=208.3 

b=57 - 8 A ' =-65.5 A and 3-107.2°. 

22. The crystalline form of claim 20, where: 



-noclinic unit cells have dimens; ~" ^ 

c«66.i A and 3-107.7°, 



A." b.51.3 A, UAmenS1 ° nS <* -bout a=21i. 6 



23. The crystalline form of claim in 

„„„ claim 10, comprising 

one or more heavy metal atoms> P " 9 

24. A polypeptide corresponding to the catalytic 

d ° main " ' Pr ° tein C — ^ase, containing at t 
a^ut 20 amino acid residues upstream of the fLst 
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glycine in the conserved glycine -rich region of the 
catalytic domain, and at least about 17 amino acid 
residues downstream of the conserved argmme located at 
the C- terminal boundary of the catalytic domain. 

25. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a receptor protein tyrosine 
kinase . 

26. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a non-receptor protein 
tyrosine kinase. 

27. The polypeptide of claim 25, wherein said 
receptor tyrosine kinase is selected from the group 
consisting of FGF-R, PDGF-R, KDR, CCK4 , MET, TRKA, AXL, 
TIE , EPH, RYK, DDR, ROS , RET, LTK, R0R1 , and MUSK. 

28. The polypeptide of claim 26, wherein said non- 
receptor kinase is selected from the group consisting of 
SRC, BRK, BTK, CSK, ABL, ZAP70, FES , FAK, JAK, and ACK. 

29. The polypeptide of claim 24 having the amino 
acid sequence shown in SEQ ID NO:4. 

30. A method of using the polypeptide of claim 24 
to form a crystal, comprising the steps of: 

(a) mixing a volume of polypeptide solution 
with a reservoir solution; and 

(b) incubating the mixture obtained in step 
(a) over the reservoir solution in a closed container, 
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ur.cer cone: ; 



-Ons suitable 



:or crystaliizatior 



31. A method of obtaining an FGF receptor tyrosine 
kinase domain polypeptide in crystalline form, 
comprising the steps of: 

(a) mixing a volume of polypeptide solution 
with an equal volume of reservoir solution, wherein said 
polypeptide solution comprises 1 mg/mL to 60 mg/mL FGF- 
type tyrosine kinase domain protein, 10 mM to 200 mM 
buffering agent, 0 mM to 20 mM dithiothreitol and has a 
pH of about 5.5 to about 7.5, and wherein said reservoir 
solution comprises 10% to 30% (w/v) polyethylene glycol, 
0.1 M to 0.5 M ammonium sulfate, 0% to 20% (w/v) 
^ ethylene glycol or glycerol, 10 mM to 200 mM buffering 
agent and has a pH of about 5 . 5 to about 7.5 ; and 

(b) incubating the mixture obtained in step 
(a) over said reservoir solution in a closed container 
at a temperature between 0° and 25° °C until crystals 
form. 

32. The method of claim 31, wherein said 
polypeptide solution comprises about 10 mg/mL FGF 
receptor tyrosine kinase domain, about 10 mM sodium 
chloride, about 2 mM dithiothreitol, about 10 mM Tris- 
HC1 and has a pH of aSout 8 ; the reservoir buffer - 
comprises about 16% (w/v) polyethylene glycol (MW 
10000), about 0.3 M ammonium sulfate, about. 5% ethylene 
glycol or glycerol, about 100 mM bis-Tris and has a pH 
of about 6.5; and the temperature is about 4°C. 

33. The method of claim 31, wherein said 
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polypeptide solution comprises a compound. 

34 . A cDNA encoding an FGF receptor tyrosine 
kinase domain protein, wherein a coding strand of the 
cDNA has tne nucleotide sequence of SEQ ID NO : 5 . 

35. A method of determining three dimensional 
structures of protein tyrosine kinases with unknown 
structure comprising the step of applying structural 
atomic coordinates set forth in Table 1, Table 2, Table 
3 , or Table 4 . 

36. The method of claim 35, comprising the 

following steps: 

(a) aligning a first computer representation 
of an amino acid sequence of a protein tyrosine kinase 
of unknown structure with a second computer 
representation of a protein tyrosine kinase of known 
structure by matching homologous regions of amino acid 
sequences of said first computer representation and said 
second computer representation; 

(b) transferring computer representations of 
amino acid structures in said protein tyrosine kinase of 
known structure to computer representations of 
corresponding amino acid structures in said protein 
tyrosine kinase with unknown structure; and 

(c) determining a low energy conformation of 
the protein tyrosine kinase structure resulting from 
step (b) . 

37. The method of claim 35, comprising the 
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following steps: 

(a) aligning the positions of atoms in the 
unit cell by matching electron diffraction data from two 
crystals; and 

(b) determining a low energy conformation of 
the resulting protein tyrosine kinase structure. 

38. The method of claim 35, comprising the 
following steps: 

(a) determining the secondary structure of a 
protein tyrosine kinase structure using NMR data; and 

(b) simplifying the assignment of through- 
space interactions of amino acids. 

39. The method of any one of claims 35, 36/37, or 
38, wherein said protein tyrosine kinase with or without 
known structure is a receptor protein tyrosine kinase. 

40. The method of claim 39, wherein said receptor 
protein tyrosine kinase with or without known structure 
is selected from the group consisting of FGF-R, PDGF-R, 
FLK, CCK4, MET, TRKA, AXL, TIE, EPH, RYK, DDR, ROS, RET, 
LTK, ROR1, and MUSK. 

41. The method of anyone of claims 35, 36, 37, or 
38, wherein said protein tyrosine kinase with or without 
known structure is a non- receptor protein tyrosine 
kinase . 

42. The method of claim 41, wherein said protein 
tyrosine kinase with or without known structure is 
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selected from -he group consisting of SRC, BRK , 3TK, 
CSK, A3L , ZAP7Q, FES , FAX, J AX , and ACX. 

43. A method cf identifying a potential modulator 
of protein tyrosine kinase function by docking a 
computer representation of a structure of a compound 
with a computer representation of a structure of a 
cavity formed by the active-site of a protein tyrosine 
kinase, wherein said structure of said protein tyrosine 
kinase is defined by atomic structural coordinates set 
forth in Table 1, Table 2, Table 3, or Table 4. 

44. The method of claim 43, comprising the 
following steps: 

(a) removing a computer representation of a 
compound complexed with a protein tyrosine kinase and 
docking a computer representation of a compound from a 
computer data base with a computer representation of the 
active-site of the protein tyrosine kinase; 

(b) determining a conf ormation of the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 

(c) identifying compounds that best fit said 
active-site as potential modulators of protein tyrosine 
kinase function. 

45. The method of claim 43, comprising the 
following steps: 

(a) modifying a computer representation of 
compound complexed with a protein tyrosine kinase by the 
deletion of a chemical group or groups or by the 
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addition of a chemical group or groucs ; 

(b) determining a conf orma: ion of the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 

(c) identifying compounds that best fit the 
protein tyrosine kinase active-site as potential 
modulators of protein tyrosine kinase function. 

46. The method of claim 43, wherein said method 
comprises the following steps:' 

(a) removing a computer representation of a 
compound complexed with a protein tyrosine kinase; and 

(b) searching a data base for data base 
compounds similar to said compounds using a compound 
searching computer program or replacing portions of said 
compound with similar chemical structures from a data 
base using a compound construction computer program. 

47. The method of any one of claims 43, 44, 45, or 
46, wherein said protein tyrosine kinase is a receptor 
protein tyrosine kinase. 

48. The method of claim 47, wherein said receptor 
protein tyrosine kinase is selected from the group 
consisting of FGF-R, PDGF-R, FLK, CCK4, MET, TRKA, AXL, 
TIE, EPH, RYK, DDR, ROS, RET, LTK, R0R1, and MUSK. 

49. The method of anyone of claims 43, 44, 45, or 
46, wherein said protein tyrosine kinase is a non- 
receptor protein tyrosine kinase. 
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50. The method of claim 49, wherein said protein 
tyrosine kinase is selected from the group consisting of 
SRC, 3RK, 3TK, CSK, ABL , ZAP70 , FES , FAK, JAK , and ACK . 

51. a potential modulator of protein tyrosine 
kinase function identified by the method of any one of 
claims 43, 44, 45, or 46. 

52. The potential modulator of claim 51, wherein 
said modulator is selected from a computer data base. 

53. The potential modulator of claim 51, wherein 
said modulator is constructed from chemical groups 
selected from a computer data base. 

54. The potential modulator of protein tyrosine 
kinase function of claim 51, wherein said modulator is 
an indolinone compound of formula I or II: 
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or a pharmaceuticaliy acceptable salt, isomer, 
metabolite, ester, amide, or prodrug thereof, wherein 

(a) A x , A 2 , A 3 , and A< are independently carbon or 
nitrogen; 

(b) R x is hydrogen or alkyla- 
te) R 2 is oxygen in the case of an oxindolinone or 

sulfur in the case of a thiolindolinone; 

(d) R 3 is hydrogen; 

(e) R 4/ R 5/ r 6 , and R 7 are optionally present and are 
either (i) independently selected from the group 
consisting of hydrogen, alkyl, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl, S(0)R, 
S0 2 NRR', S0 3 R, SR, N0 2 , NRR' , OH, CN, C(0)R, 0C<O)R, 
NHC(0)R, (CH 2 ) n C0 2 R, and CONRR' or (ii) any two adjacent 
R«, Rs/ R«# and R 7 taken together form a fused ring with 
the aryl portion of the oxindole -based portion of the 
indolinone; 

(f) R 3 ', R 3 ( , V, Rs', and R«' are each 
independently selected from the group consisting of 
hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, trihalomethyl, S(0)R, S0 2 NRR», S0 3 R, 
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3R, NC 2 , NRR', OH, CN , C(0)R, OC(0)R, NHC(0)R, (CH 2 ) n C0 2 R, 
and CONRR ' ; 

(g) n is 0, 1, 2, or 3; 

(h) R is hydrogen, alkyl or aryi ; 

(i) R» is hydrogen, alkyl or aryi; and 

(j) A is a five membered heteroaryl ring selected 
from the group consisting of thiophene, pyrrole, 
pyrazole, imidazole, 1,2, 3-triazole, 1, 2, 4-triazole, 
oxazole, isoxazole, thiazole, isothiazole, furan, 1,2,3- 
oxadiazole, 1, 2 , 4 -oxadiazole , 1,2, 5-oxadiazole, 1,3,4- 
oxadiazole, 1,2,3 , 4 -oxatriazole , 1,2,3, 5-oxatriazole , 

1. 2. 3- thiadiazole, 1, 2 , 4 - thiadiazole , 1 , 2 , 5- thiadiazole , 

1. 3. 4 - thiadiazole, 1,2, 3 , 4 - thiatriazole , 1,2,3,5- 
thiatriazole, and tetrazole, optionally substituted at 
one or more positions with alkyl, alkoxy, aryi, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl , S(0)R, 
S0 2 NRR', S0 3 R, SR, N0 2 , NRR', OH, CN, C(0)R, 0C(0)R, 
NHC(0)R, (CH 2 ) n C0 2 R or CONRR' . 

55. A method of identifying a potential modulator 
of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, comprising the 
following steps: 

(a) administering said potential modulator to 

cells; 

(b) comparing the level of protein tyrosine 
kinase phosphorylation between cells not administered 
the potential modulator and cells administered said 
potential modulator; and 

(c) identifying said potential modulator as a 
modulator of protein tyrosine kinase function based on 
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-he difference in the level of protein tyrosine kinase 
phosphorylation . 

55. A method of identifying a potential modulator 
of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, wherein said method 
comprises the following steps: 

(a) administering a preparation of said 
potential modulator to cells; 

(b) comparing the rate of cell growth between 
cells not administered the modulator and cells 
administered the modulator; and 

(c) identifying said potential modulator as a 
modulator of protein tyrosine kinase function based on 
the difference in the rate of cell growth. 

57. A method of treating a disease associated with 
a protein tyrosine kinase with inappropriate activity in 
a cellular organism, wherein said method comprises the 
steps of: 

(a) administering a modulator of protein 
tyrosine kinase function to the organism, wherein said 
modulator is in an acceptable pharmaceutical 
preparation; and 

(b) - activating or inhibiting the protein 
tyrosine kinase function .to treat the disease. 



58. The method of any one of claims 55, 56, or 57, 
wherein said protein tyrosine kinase is a receptor 
protein tyrosine kinase. 
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59. The method of claim 53, wherein said receptor 
protein tyrosine kinase is selected from the group 
containing FGF-R , ?DG?-R, FLK, CCK4 , MET , TRKA, AXL , 
TIE , E?H, RYK , DDR, ROS , RET, LTK, ROR1, and MUSK. 

60. The method of any one of claims 55, 56, or 57, 
wherein said protein tyrosine kinase is a non-receptor 
protein tyrosine kinase. 

61. The method of claim '60, wherein said non- 
receptor protein tyrosine kinase is selected from a 
group consisting of SRC, BRK, BTK, CSK, ABL, ZAP70, FES, 
FAK, JAK, and ACK. 
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